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FACTORS IN THE VARIABILITY OF RESPONSE 
IN ENUMERATIVE STUDIES 


By Guapys L. PALMER 
Industrial Research Department, University of Pennsylvania 


error that arises in sampling is well recognized, but other sources, 
which may be more important, are often overlooked, even though they 
are present as much in complete coverages as in samples. For instance, 
there is a variability of response to be reckoned with even within the 
framework of well defined schedule concepts and well trained enumer- 
ators. Poorly worded schedule questions and inadequate training of 
enumerators not only intensify this variability, but are the cause of un- 
reliability in the sense of biases. However, in the absence of a standard 
operation, which by definition would yield no error and would enable 
one to differentiate correct from incorrect results, and thus to recognize 
biases, it is perhaps better to speak of variability of response merely as 
a source of difference between the results of two or more coverages of 
the same universe, rather than to speak of a bias in one or both cover- 
ages. The purpose of this paper is to present some results on vari- 
ability of response.’ 

Relatively little is known about the degree of variability of response 
for different types of information or its relationship to other conditions 
of enumeration because the opportunity to measure it is limited to 
studies in which two sets of interviews requesting the same information 
for the same persons for the same period of time have been made. Rel- 
atively high costs have discouraged this kind of enumerative study 
while unfortunately, at the same time, differences in the results from 
applications of parallel techniques to the same universe for measuring 
the same phenomenon, without adequate explanation of the causes of 
these differences, have led to public confusion. As a consequence, doubt 
has been cast on the results of enumerative studies, yet these are the 


{~~ ARE SEVERAL sources of error in enumerative studies. The 


1 The writer is indebted to Dr. W. Edwards Deming and Dr. A. Ross Eckler of the Bureau of the 
Census for helpful suggestions with regard to this analysis. 
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only, or the major, source of information for many types of im- 
portant social and economic data.’ 

An opportunity to study variability of response is found in two sets 
of interviews of 8519 persons in 2686 Philadelphia households obtained 
by two groups of enumerators who had been subject to the same train- 
ing program. The purpose of this analysis is to determine the relation of 
certain conditions of enumeration to variation in individual response 
and the effect of variation in individual response upon the summarized 
results. 

Four important social and economic characteristics were selected for 
the analysis of variability of response: marital status, age, employment 
status, and education. These characteristics were related to the follow- 
ing conditions of enumeration: 

1. The proportion of Negro or foreign-born residents, or both com- 
bined. 

2. The number of days of experience of enumerators at the time of the 
first field visit. 

3. The lapse of time between the two interviews. 

4. The personnel ratings of the original and revisit enumerators.® 


Findings of this Study.—The findings of this study indicate that in 
the census type of survey, where the housewife is the usual respondent, 
one may expect a negligible variation response for marital status (1.7 
per cent), a higher variation in response for employment status (9.9 per 
cent), and age (12.8 per cent), and the highest for education (23.4 per 
cent). These differences in variability of response arise partly from in- 
herent difficulties in identification of status or classification. This is es- 
pecially true with respect to returns on employment status for persons 
whose labor market attachments are difficult to identify or classify. It is 
also true for information on education for persons whose schooling has 
been informal or is described in terms of the educational systems of 
foreign countries which differ from the American school system. Varia- 
tion in response for age or marital status cannot be explained in these 
terms, however, and is apparently the result of indifferent or inaccurate 
reporting. 

Variability of response is lower for returns from districts in which 
there is a relatively high proportion of native-white residents than for 

2 For a summary of recent studies discussing this problem, see Gladys L. Palmer, The Reliability of 


Response in Labor Market Inquiries, Technical Paper #22, Report to Federal Statistical Agencies, Bureau 
of the Budget, July 1942. 

* The personnel rating was based on all work of the enumerators during the survey and not on the 
results of the revisit program. Ratings were made by supervisors for personal appearance and personal- 
ity, accuracy, learning ability, and output. These were averaged to give grades A, B, C, D, E, and F and 
yield a rough estimate of the comparative ranking of the enumerators’ work. 
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districts in which there is a relatively high proportion of Negroes or 
of Negroes and foreign-born white residents combined. 

None of the other conditions of enumeration which were selected for 
analysis in this study shows consistent relationships to variability of re- 
sponse for all four characteristics, nor are there significant differences in 
the variability of response for returns for men as compared with returns 
for women. Although individual responses in the two field visits show a 
relatively high variability for age and education, the summarized re- 
sults for these items in the two enumerations in terms of frequency dis- 
tributions and averages, are stable. This is not so, however, in the case 
of employment status. 

Providing the enumerators measure up to a certain minimum stand- 
ard, as the enumerators did for this survey, there is little correlation 
beyond that point between the individual’s subjective rating and his 
performance as measured by variability of response. Variability of re- 
sponse is to be expected even under the best enumerating conditions, 
and this may reach surprisingly high proportions, as is indicated by 
the figures quoted above. 

Source of Information.—There were 2686 Philadelphia households 
in the revisit sample, with a count of 8848 persons 16 years of age and 
over in the first enumeration and 8843 in the second.‘ There were 8519 
persons for whom data could be matched for both visits, although all 
of the information analyzed in this study was not reported on both 
visits for each person enumerated twice.’ The comparative data pre- 
sented show differences for those of the total group of 8519 persons for 
whom the items were reported twice. 

Selection of Enumerators.—The 36 enumerators employed in the re- 
visit program were, on the whole, more experienced in schedule inter- 
viewing than the 197 enumerators who made the first visit. Both groups 
of enumerators, however, were given the same training with respect to 
the use of the schedule. Both groups were composed of high school or 
college graduates most of whom had had business, teaching, or social 
work experience. The information called for in the second visit re- 
ferred to conditions as of the date of the first visit, and the second 
enumerators had no information about the household except the ad- 


4 These counts included 324 persons 16 years of age and over who were included in the second 
enumeration but not in the first, and 329 included in the first and not in the second. This difference in 
the count of persons 16 years of age and over arises to some extent from actual changes in household 
composition, residence, or age between the two visits and from differences in the application of the 
“household” concept by the two groups of enumerators, but is primarily the result of inconsistencies in 
the reporting of respondents. 

5 For persons enumerated in both visits, employment status was reported for all persons, marital 
status for all but a few individuals, age for 99 per cent, and education for 95.8 per cent. It may be noted 
that there were more failures to report when a different respondent gave the information on the second 
visit, 
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dress.° An average (median) of 8.7 days elapsed between the two visits, 
and all revisits had been completed within 23 days. 

In 63 per cent of the households, the same person answered the 
questions on both visits of the enumerators. In about three-fifths of 
these cases, the respondent was the housewife. Household heads or 
other relatives were the respondents in the remainder of the cases in 
which the same respondent gave the information on both visits. 


TABLE I 


PER CENT REPORTING DIFFERENT RESPONSE 
BY CITY DISTRICT COMBINATIONS 












































All respondents Same respondent | Different respondent 
City distriet combinations Total Different Total Different Total Different 
Num- | Num- Per |Num-|Num-}; Per | Num-|Num-| Per 
ber | ber | cent ber ber | cent ber ber cent 
Marital status 
Total* 8515 141 1.7 | 5373 87 1.6 | 3142 54 1.7 
Districts 1, 7, 8, 9, 10 4293 58 1.4 | 2800 36 1.3 | 1493 22 1.5 
Districts 2, 3, 4, 5, 6t 4222 83 2.0 | 2573 51 2.0 | 1649 32 1.9 
Age 
Total* 8472 | 1086 | 12.8 | 5349 552 | 10.3 | 3123 534 17.1 
Districts 1, 7, 8, 9, 10 4262 52 12.3 | 27 270 9.7 1476 255 17.3 
Districts 2, 3, 4, 5, 6t 4210 561 | 13.3 | 2563 282 | 11.0 | 1647 279 16.9 
Employment status 
Total* 8519 843 9.9 | 5375 455 8.5 | 3144 388 12.3 
Districts 1, 7, 8, 9, 10 4293 367 8.5 | 2800 204 7.3 | 1493 | 163 | 10.9 
Districts 2, 3, 4, 5, 6t 4226 476 | 11.3 | 2575 251 9.7 1651 225 13.6 
Education 
Total* 8165 | 1911 | 23.4 | 5204 | 1133 | 21.8 | 2961 778 | 26.3 
Districts 1, 7, 8, 9, 10 4092 907 | 22.2 | 2706 552 | 20.4 | 1386 | 355 | 25.6 
Districts 2, 3, 4, 5, 6 4073 | 1004 | 24.7 | 2498 581 | 23.3 | 1575 | 423 | 26.9 

















* Excludes persons not reporting items tabulated. 
+ Districts with 40 per cent or more Negro and/or foreign-born residents. 


Method of Analysis.—The results of a matching of the two sets of 
schedules on a household basis, as distinct from the persons composing 
the households, have already been reported in another publication.’ 
The present analysis is made on the basis of a posting of the two sets of 


* The 2686 addresses for the 1937 revisit program had been selected at random from a larger sample 
of 45,927 households in 149 selected School Blocks scattered throughout the city of Philadelphia, which 
had been used as a sample in a series of surveys of employment and unemployment from 1931 to 1938. 
The purpose of the check was to test the application of schedule concepts by the original enumerators 
as well as to check variability of response. The selection of revisit addresses was such that the work of all 
the original enumerators could be reviewed. 

7 This matching showed a relatively high variability of response for such items as age, employment 
status, occupation and industry, duration of jobs or unemployment periods, and years of residence in 
the city, as compared with such items as race, nativity, relationship to household head, and marital 
status. Margaret W. Beil and Gladys L. Palmer, Employment and Unemployment in Philadelphia in 1936 
and 1937, Part 2, W.P.A. National Research Project in cooperation with the Industrial Research De- 
partment, University of Pennsylvania, 1938, Appendix A. 
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responses for each person in the sample households. The data were 
tabulated by the sex of the person reported, by whether the respondent 
was the same or different, and by city district. The proportion of in- 
dividuals for whom different information was reported on the second as 
compared with the first visit varied only slightly as between districts in 
the city. Variability of response was measured by the degree to which 
the returns for the two visits fell into different class-intervals when 


grouped.® 
TABLE II 


PER CENT REPORTING DIFFERENT RESPONSE BY NUMBER OF 
DAYS IN FIELD AT FIRST VISIT 





























All respondents Same respondent | Different respondent 
Days in field at firet visit Total; Different | Total| Different | Total| Different 
| 
Num-|Num-| Per | Num-|Num-| Per | Num-|Num-| Per 
ber ber | cent | ber ber | cent | ber ber | cent 
Marital status 
Total* 8484 140 1.7 | 5346 86 1.6 | 3138 54 me 
First week 4631 77 1.7 | 2899 44 1.5 | 1732 33 1.9 
Second week and later 3853 63 1.6 | 2447 42 1.7 | 1406 21 1.5 
Age 
Total* 8442 | 1084 | 12.8 | 5323 550 | 10.3 | 3119 | 534 17.1 
First week 4609 579 | 12.5 | 2890 298 | 10.3 | 1719 281 16.3 
Second week and later 3833 505 | 13.2 | 2433 252 | 10.4 | 1400 | 253 8.1 
Employment status 
Total* 8488 835 9.8 | 5348 448 8.4 | 3140 | 387 12.3 
First week 4653 471 | 10.1 | 2901 248 8.6 | 1752 | 223 12.7 
Second week and later 3835 364 9.5 | 2447 200 8.2 | 1388 | 164 1.8 
Education 
Total* 8137 | 1905 | 23.4 | 5180 | 1127 | 21.8 | 2957 77 26.3 
First week 4430 | 1053 | 23.8 | 2799 633 | 22.8 | 1631 420 25.7 
Second week and later 3707 852 | 23.0 | 2381 494 | 20.7 | 1326 | 358 von 




















* Excludes persons not reporting items tabulated. 


Factors in the Variability of Individual Response.—For all items ex- 
cept marital status, variability of response was significantly higher in 
cases in which the respondent was a different person on the two visits. 
The accompanying tables show this distinction. There seem to be no 
significant differences in the variability of response in reports for men 
as compared with women; consequently the data for the sexes have 
been combined in the analysis of differences in individual response. 

Variability of response is found to be higher for districts with 40 per 
cent or more Negro or foreign-born residents or both combined, than 
for districts predominantly composed of native-born white residents 


® The districting of the city follows the pattern of the School District divisions in use in 1937 when 
the original survey was made. All sections of the city were included. 
* The olass-intervals used for frequency distributions will be found in Table IV. 
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(Table I). The difference is not significant for returns for age, but is 
significant for returns for marital status, education, and employment 
status. It may be noted that this relationship holds for the returns for 
all four items when these were given by the same respondent, and for 
all items except age for returns given by a different respondent. 


TABLE III 


PER CENT REPORTING DIFFERENT RESPONSE 
BY DAYS BETWEEN TWO VISITS 








| 









































All respondents Same respondent | Different respondent 
= Total Different Total Different Total Different 
Days between two visits sl 
Num-|Num-| Per |Num-|Num-| Per | Num- Num- | Per 
ber ber | cent | ber ber | cent | ber ber | cent 
Marital status 
Total* 7394 123 1.7 | 4656 76 1.6 | 2738 47 1.7 
1- 7 1544 25 1.6 976 15 1.5 568 10 1.8 
8-14 5376 84 1.6 | 3417 54 1.6 | 1959 30 1.5 
15-23 474 14/ 3.0] 263 7 2.7] 211 7 3.3 
Age 
Total* 7354 944 | 12.8 | 4635 480 10.4 | 2719 464 17.1 
1-7 15338 | 197 12.9] 971| 89 | 9.2| 562 | 108 | 19.2 
8-14 5350 672 | 12.6 | 3402 359 10.6 | 1948 | 313 16.1 
15-23 471 75 115.9] 262 32 | 12.2] 209 43 | 20.6 
Employment status i 
Total* 7395 735 9.9 | 4656 393 8.4 | 2739 342 12.5 
1- 7 1544 150 9.7 976 68 7.0 | 568 2 14.4 
8-14 5377 534 9.9 | 3417 303 8.9 | 1960 231 11.8 
15-23 474 51 | 10.8 263 22 8.4 211 29 13.7 
Education 
Total* 7085 | 1658 | 23.4 | 4509 | 976 | 21.6 | 2576 | 682 | 26.5 
1- 7 1472 338 | 23.0 937 193 20.6 535 145 27.1 
8-14 5159 | 1216 | 23.6 | 3319 728 21.9 | 1840 488 26.5 
15-23 454 104 | 22.9 253 55 21.7 201 | 49 24.4 














* Excludes persons not reporting items tabulated. 


Compared with later returns, the results obtained during the first 
week of the first enumeration show only a slightly higher variability in 
the returns for marital status, employment status, and education, but 
not for age (Table II). It may be inferred that even well trained enumer- 
ators may become careless after the initial training wears off, and the 
effect of this carelessness almost counterbalances the steadying influence 
of experience. Moreover, the greater the lapse of time between the two 
visits, the greater the variability of response on marital status, age, and 
employment status (Table III), but not on education. 

Variability Arising from Rating of Revisit Enumerators.—The revisit 
enumerators were rated subjectively A, B, C, D, E (footnote 3), and 
the accompanying ‘chart was plotted to learn whether there is any 
relation between an individual’s rating and variability of response, as 








- FACTORS IN THE VARIABILITY OF RESPONSE IN ENUMERATIVE STUDIES 149 
VARIATION IN RESPONSE FOR MARITAL STATUS, AGE, 
EMPLOYMENT STATUS, AND EDUCATION 
By Personnel Rating of Revisit Enumerator 


REVISIT ENUMERATOR BY PERSONNEL RATING 
Cc 
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measured by the per cent of responses which differed in the two visits. 
There is a slight indication that A and B revisit enumerators showed 
less variability than those graded D and E. A few revisit enumerators 
showed variability outside the control limits (not drawn), but they are 
found in B ratings as well as D and E. This chart is consistent with the 
statement regarding the enumerators, that beyond a certain standard 
(surpassed by even the E revisit enumerators), there is little relation be- 
tween performance as measured by variability of response and rating. 
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This does not mean that the general quality of schedule reporting is not 
better with experienced than with inexperienced enumerators, but that 
within any given framework of schedule concepts and training and 
qualifications of enumerators, variability of response is to be expected. 

These findings support the evidence of another study in which varia- 
bility of response in the work histories of skilled metal workers in 
Minneapolis was analyzed. In this study, many more factors were con- 
trolled than in a census type of survey, because a homogeneous group of 
workers was interviewed. Nevertheless, it was found that there was no 
consistent relationship between variability of response and the personal- 
ity of interviewers or the lapse of time between the two interviews. More- 
over, there was no consistent relationship between variability of response 
and a worker’s age, education, or his nativity. The major factors which 
appeared to influence variability of response in this study of work his- 
tory information were the simplicity or complexity of the data to be re- 
called and combinations of personal or psychological factors in response 
which could not be measured from available statistical data.!° 

Effects of Variation in Individual Response on the Summarized Results.—- 
Although the degree of variability of response for individuals was rel- 
atively high for age and education, the effects of this are less significant 
when the data from the two enumerations are summarized (Table IV). 
The frequency distributions and averages secured for age and education 
show no significant differences, either for the total, or for men and wom- 
en separately; nor are there significant differences in the proportions 
reporting various marital status categories in the two enumerations. 

With regard to employment status, on the other hand, there are sig- 
nificant differences in the proportions reporting in certain categories in 
the two enumerations, when compared with each other and with the re- 
sults of a combination of the two visits. The second visit yielded a larger 
labor force count than the first visit because it secured a considerably 
higher unemployment count. These differences are significant when re- 
lated to the results of both visits combined. In the case of men, the 
count of the unemployed, and in the case of the women, the counts of 
the labor force, of non-workers, and of the unemployed differ signifi- 
cantly. This finding is corroborated by the results of other comparisons 
of employment status returns for the same individuals." 

The finding that, in general, summarized results are more stable than 
individual responses, is supported by other studies.'? Many inconsist- 


10 Unpublished manuscript by Robert Dinkel in the files of the National Research Project of the 


Works Projects Administration, 

11 Palmer, op. cit., footnote 2. 

12 Read Bain, “Stability in Questionnaire Response,” American Journal of Sociology, November 
1931, pp. 445-453; Ruth S. Cavan, “The Questionnaire in a Sociological Research Project,’’ American 
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TABLE IV 


MARITAL STATUS, AGE, EMPLOYMENT STATUS, AND EDUCATION BY 
SEX AS REPORTED IN THE FIRST AND SECOND ENUMERATIONS 





























































































































| First enumeration Second enumeration 
Total Men Women Total Men Women 
Num-| Per | Num-| Per | Num-} Per |} Num-| Per | Num-| Per | Num-| Per 
ber |cent| ber | cent| ber | cent|| ber | cent| ber | cent! ber | cent 
Marital Status 
Total* 8846 100.0) 4449 100.0! 4397 100.0) 8822 |100..0| 4431 100.0] 4391 |100.0 
Single 3184 36.0) 1775 39.9) 1409 | 32.1|| 3213 | 36.4! 1793 40.5) 1420 | 32.3 
Married 4631 | 52.3) 2335 | 52.5) 2296 52.2 4535 51.4) 2279 51.4) 2256 | 51.4 
Othert 1031 11.7| 339 7.6) 692 15.7) 1074 12.2) 359 8.1) 715 16.3 
Age 

Total* 8834 |100.0) 4450 {100.0} 4384 100.0) 8782 |100.0) 4419 |100.0) 4363 |100.0 

16-19 970 | 11.0} 465 | 10.4) 605 | 11.5 987 | 11.3} 471 | 10.7); 516 / 11.8 
20-24 1241 | 14.0] 648 | 14.6) 593 | 13.5|| 1227 | 14.0) 648 | 14.7) 579 | 13.3 
25-29 1022 | 11.6] 501 | 11.3) 521 | 11.9]| 1011 | 11.5) 479 | 10.8) 532 | 12.2 
30-34 790 8.9| 417 9.4) 373 8.5 761 8.7) 401 9.1; 360 8.2 
35-39 818 9.3) 389 8.7) 429 9.8|| 827 9.4) 412 9.3) 415 9.5 
40-44 834 9.4) 422 9.5) 412 9.4 819 9.3) 395 8.9| 424 9.7 
45-49 785 8.9) 402 9.0) 383 8.8 782 8.9) 393 8.9} 389 8.9 
50-54 739 8.4) 381 8.6) 358 8.2)| 732 8.3) 385 8.7| 347 8.0 
55-59 545 6.2} 289 6.5) 256 5.8}; 530 6.0| 287 6.5) 243 5.6 
60-64 451 5.1; 210 4.7| 241 5.5|| 463 5.3} 223 5.0} 240 5.5 
65 and over 689 7.2| 326 7.3) 313 7.1 643 7.3} 325 7.4| 318 7.3 

Median age 37.4 37.5 37.3 37.5) 37.6 37.3 

Employment Status 
Total* 8519 !100..0 4266 |100.0) 4253 100.0 8519 {100.01 4266 hoo..o 4253 |100.0 
Labor force 5444 63.9) 3858 | 90.4) 1586 37.3) 5569 65.4| 3891 | 91.2) 1678 | 39.5 
Employed 3312 | 38.9] 2317 | 54.3) 995 23.4) 3222 37.8) 2249 | 83.7) 973 | 22.9 
Unemployed 1797 | 21.1) 1268 | 29.7} 529 | 12.4|| 2017 | 23.7) 1374 32.2) 643 | 15.1 
Emergency 335 | 3.9] 273] 6.4] 62] 1.5]| 330] 3. 9| 268 | 6.3/ 62] 1.5 
Not seeking work | 3075 36.1) 408 6.6) 2667 62.7) 2950 34.6) 375 8.8) 2575 | 60.5 
Education 
! 

Total* 8442 100.0! 4218 100.0! 4224 100.0| 8207 100.0! 4069 00.0] 4138 |100.0 
0 483 5.7) 215 5.1) 268 6.4) 470 5.7) 194 4.8) 276 6.7 
1- 4 747 8.9| 374 8.9| 373 8.8 714 8.7| 370 9.1) 344 8.3 
5- 8 4206 | 49.8] 2129 | 50.5) 2077 os 4180 | 51.0) 2121 | 52.1) 2059 | 49.8 
9-12 2651 | 31.4) 1284 | 30.4} 1367 | 32.4|| 2501 | 30.5) 1176 28.9) 1325 | 32.0 
13-16 318 3.8) 185 4.4) 132 3.1 290 3.5) 164 4.0) 126 3.0 
17 and over 37 | 0.4) 31] 0.7 6) 0.1) 2; 0.6) 44] 1.1) 8| 0.2 

Median 7.8 7.9 7.8) 7.8 7.8) 7.8 






































* Excludes persons not reporting items tabulated. 
t Widowed, separated, divorced. 
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encies in reporting are inevitably compensating in character when the 
data are combined into frequency distributions. The extent to which 
variability in individual responses affects cross-classifications, such as 
employment status by age and sex or age and sex by education, has not 
been explored here but the chances of variability in the summarized re- 
sults of such cross-tabulations are obviously greater. 

It is clear from this study that a complete census of any group in the 
population is subject to some error or variability in reporting, regardless 
of the perfection of schedule concepts and definitions or the training of 
enumerators. It is not necessary to emphasize that variability of re- 
sponse affects the analysis of all population data, and is also important 
in the development of sampling techniques for enumerative studies. The 
choice of economic or social controls for stratification of sampling and 
sub-sampling units must be determined, in part, by the relative relia- 
bility of the information obtained in the survey used for controls. 





Journal of Sociology, March 1933, pp. 721-727; Alberta Curtis, “The Reliability of a Report on Listen- 
ing Habits,” Journal of Applied Psychology, Volume 23, 1939, pp. 127-130; Mapheus Smith, “A Note on 
Stability in Questionnaire Response,” American Journal of Sociology, March 1933, pp. 713-727. 








TIME SERIES SIGNIFICANCE TESTS BASED ON 
SIGNS OF DIFFERENCES* 


By Grorrrey H. Moore 
National Bureau of Economic Research 
AND W. ALLEN WALLIS 
Stanford University 


ECENTLY we proposed a significance test, appropriate to time series, 
based on signs of first differences in either the observations or 
some derivative sequence.' The test is made by tabulating the fre- 
quency distribution of phase durations—in other words the frequency 
distribution of runs in signs of first differences—and comparing it with 





aie tie 5(N —3) 
the distribution expected if the series were random, — , 
11(N —4) 4N-21_ . 
—————» am “a being the expected frequencies for phases of 


60 
durations one, two, and over two, respectively. Since this expected 
distribution is the same for all continuous populations, such a test has 
a high degree of generality. The ideal statistic for the comparison is not 
known, but for practical purposes it suffices to calculate an ordinary 
x° for goodness of fit (denoting it by x,*) and refer 6/7 of its value to 
the usual x? table for two degrees of freedom when x,?<6.3, or refer 
its whole value to a special tabulation of x? for 23 degrees of freedom 
when x,’?>6.3. 

A variation upon the x,’ test is to tabulate separate frequency distri- 
butions of durations of “expansions” (runs up) and “contractions” (runs 
down). Each distribution may be compared with an expected distribu- 
tion having frequencies half as large as those expected for the combined 
distribution, or a test of homogeneity may be applied to the two distri- 
butions. Theoretical difficulties attach to either alternative, however.? 

An alternative (but not independent) test can be made by simply 
counting the number of phases, i.e., the number of runs in signs of dif- 
ferences.’ This is equivalent to counting the number of turning points, 
i.e., reversals in direction of movement. Unless N, the number of ob- 


* This paper is a byproduct of work at the National Bureau of Economic Research, 1939-40, under 
Research Associateships provided by the Carnegie Corporation of New York. 

1W. Allen Wallis and Geoffrey H. Moore, A Significance Test for Time Series and Other Ordered 
Observations. National Bureau of Economic Research Technical Paper 1 (September, 1941), 59 pp. Fora 
brief summary of this monograph, see W. Allen Wallis and Geoffrey H. Moore, “A Significance Test for 
Time Series Analysis,” this JouRNAL, 36 (1941), pp. 401-9. 

2 See the Technical Paper, op. cit., pp. 40-41. In addition to the difficulties there mentioned, it 
seems doubtful that the yp?distribution is appropriate to chi-squares derived from separate distributions 
of expansions and contractions. 

3 Op. cit., Sec. VII. 
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servations, is small (in which case tabulations of the exact distributions 
are available), such counts are normally distributed with variance of 
16N —29 2N —4 
“— and mean of 


2N-—7 


for the number of turning points or 





for the number of phases. 





The present paper discusses three additional tests based on signs of 
differences: 

(1) The total number of signs of each kind may be counted. The exact 
distributions are tabulated in Table I for N <12, and for longer series 
the number of signs of a given kind is normally distributed about 


N-1 . : N+1 
with variance of . 








(2) The regularity of a periodic pattern, as judged by the tendency 
of the signs of one kind to cluster together in the pattern, can be tested, 
making allowance for any excess of signs of one kind. The exact proba- 
bilities can be calculated from a recursion formula, but have been tabu- 
lated only for the particular case of a nine point pattern. 

(3) The existence of correlation between time series, abstracting from 
the effect of their trends, may be studied by making a 2 X2 table of the 
signs of differences in the two series. The method may also be extended 
to more than two series. Although such a test would be of great value, 
the sampling distribution is known only for the special case of ran- 
domly arranged first differences. 

In conclusion, a few general considerations relating to tests based on 
signs of differences are discussed. 

I 

A quick test of the randomness of a series can be made by counting 
the plus and minus signs in the first differences. A series of N observa- 
tions has N —1 differences, and if the observations are in random order 


the expected number of plus or minus signs is obviously es She 


1 
» and the distri- 





variance of the number of signs of one kind is 


bution rapidly approaches normality as N increases. 

These results were reached by working with exact distributions for 
N <12, the exact distributions being calculated from the recursion rela- 
tion 


on(m) = (N — m)on-i(m — 1) + (m + 1)goy-a(m), 


where ¢y(m) represents the number of permutations of N different 
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numbers that produce exactly m positive (or m negative) differences.‘ 
For a series of values of N the variances were found to have constant 
first differences of 1/12; since when N =2 the variance is }, the general 


, : . N+ , 
expression for the variance is ~ 9 .. The third moment about the mean 





is zero for all values of N, since the distributions are symmetrical. A 
series of values of the fourth moment about the mean shows, when 
N 24, constant second differences of 1/24; since when N =4 the fourth 
moment about the mean is 23/48 and the first difference 53/240, the 
general expression for the fourth moment about the mean is 
5N?+8N+3 

240 

As N increases, the kurtosis, measured by the ratio of the fourth 
moment about the mean to the squared variance, approaches 3, the 
value for a normal distribution. For all values of N the skewness is, of 
course, zero, as it is for a normal distribution. Comparison of the exact 
distribution for N=12 with that based on a normal approximation 
shows fair agreement; so for N >12 the normal distribution apparently 
provides a satisfactory approximation. When using a normal ap- 
proximation the Laplace correction for continuity, which consists of 
subtracting 3 from the discrepancy between observed and expected 
frequencies, should be applied. (A formula for calculating the normal 
deviate is given in the note to Table I.) For N $12 the exact distribu- 
tions are shown in Table 1. 

A test of the number of signs of one kind may serve as a test of cor- 
relation between two variables. The paired values are arranged in 
order according to one variable, and the signs of differences are tabu- 
lated for the other variables. Such a test is perhaps most appropriate 
for the detection of trend—the correlation of a series with time—for 
here no question arises as to which of the variables is to be arranged in 
order. In other cases the choice may not be obvious, and the result 
may depend upon which variable is selected. 

‘ This relation was derived empirically from equation 4, page 19, of the Technical Paper. As is 
pointed out there (pages 5 and 11), a distribution of signs of differences is exactly the same for random 
arrangements of N different numbers as for samples of N drawn from any continuous population. 

After this paper was in proof we discovered that exactly the same recursion relation given above was 
given in a different context by Major P. A. MacMahon, “Second Memoir on the Compositions of 
Numbers,” Philosophical Transactions of the Royal Society of London, 207A (1908), pp. 65-134. The re- 


cursion relation given above appears in Art. 30, p. 83. At the end of Art. 33, p. 81 of the same memoir, 
MacMahon gives an explicit formula which, in our notation, may be written 


~ {N+1 ei 
$n(m)=) (-1)' »— M(m+1—i)". 
i=0 . 
He also tabulates ¢y(m) through N =6 on page 81. 


‘ This expression may also be deduced easily from the familiar theorem that the variance of the 
sum of k variates is the sum of the & variances plus twice the sum of the [k(k —1)]/2 covariances. 
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As an illustration, this test of the sign counts has been applied to 
sweet potato production, yield per acre, and acreage harvested in the 
United States,® 1868-1937. The production series shows 37 rises and 32 
declines. Since N =70 (observe that N is the number of observations, 
not differences), the expected number of rises or declines is 34.5, which 
differs from the observed number by 2.5, or 2.0 when corrected for 
continuity. The variance is 71/12, or the standard deviation 2.432. 
Taking 2.0/2.432 =0.822 as a normal deviate, we find a probability of 
0.41 for so large a departure fro:n expectation. Hence there is no evi- 
dence of trend. For acreage, the probability is about 0.0000000001, 
clearly indicating the presence of trend.’ Actually, the charts of the two 
series suggest that approximately the same degree of trend is present in 
both series. The production series, however, contains random move- 
ments whose year-to-year variations are much greater than the yearly 
trend effects; whereas in the acreage series, particularly in the earlier 
years, the random variations are small relative to the trend. Any test of 
significance judges systematic variations by the relation of their mag- 
nitude to that of random variations; in this case the consequence is de- 
tecting the trend in the more stable acreage series and missing a similar 
trend in the more erratic production series. (This point is discussed 
further in the final section.) The yield series shows 35.5 rises and 33.5 
falls (counting the one year of no change as half a unit in each cate- 
gory). Since as great a difference would result 84 per cent of the time 
from random causes alone, there is no evidence of trend; and this ac- 
cords with the appearance of the chart. 

It should be emphasized that the null hypothesis is that the observa- 
tions are in random order, not that the first differences are random. To 
illustrate the distinction, suppose that of the six differences in a series 
of seven observations, 5 are of one sign and 1 of the other. On the null 
hypothesis that the seven numbers are in random order, we find the 
exact probability of a division as unequal as five-to-one to be 0.048 
(Table I), so should probably conclude that the discrepancy is sig- 
nificant. Had we, on the other hand, formulated the null hypothesis 
that the differences are random and equally likely to be positive or 
negative we should have used a binomial distribution and found a 
probability of 0.219, more than 43 times as great and definitely not 
significant. The variance on the null hypothesis that the differences are 


6 The original data (taken from Agricultural Statistics, 1939, p. 243) and a chart are shown in the 
Technical Paper, pp. 37-39. 

7 Similar results were obtained in the Technical Paper, pp. 40-41, by testing separate frequency 
distributions of expansion and contraction phases. The number of signs of one kind is of course equiva- 
lent to the total duration of phases of one kind, hence is related to their frequency distribution by dura- 
tion. The test of sign count or total duration, however, is free from the theoretical difficulties that attach 
to a test of the frequency distributions. 
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random and equally likely to be positive or negative is (VN —1)/4, and 
therefore (when N >2) exceeds the variance on the null hypothesis that 
the observations are random; as N increases, the ratio between the two 
variances approaches 3. Hence the first hypothesis yields larger proba- 
bilities for given departures from expectation. 

In economic research it is not always easy, but it is of course essen- 
tial, to determine which of the two null hypotheses is appropriate. In 
testing the randomness of changes in stocks of commodities, for exam- 
ple, it might be appropriate to use a binomial distribution in connection 
with signs of first differences of stocks; but in testing whether stocks 
themselves constitute a random series, the appropriate distribution is 
the one given here. It may be observed that an inequality in the sign 
counts which cannot be accounted for on the assumption of random 
differences certainly cannot be accounted for on the assumption of ran- 
dom observations; and an inequality that can be explained by the 
hypothesis of random observations can be still more easily explained 
by the hypothesis of random differences. 


II 


A second kind of problem for which the signs of differences provide 
an appropriate significance test arises in periodogram analysis and 
similar procedures. Suppose a series has been divided into equal seg- 
ments, or segments derived according to some external criterion, e.g., 
the turning dates of cycles in general business. It may be desired to 
test whether the movements within such segments tend to be smooth 
and regular rather than zig-zags, since regularity can normally be taken 
as indirect evidence that the period used is reflected in the data. Per- 
haps a description of the particular situation that resulted in this appli- 
cation will clarify the problem. 

One feature of the method of studying cyclical behavior devised by 
the National Bureau of Economic Research involves dividing time 
series into segments corresponding with the dates of so-called reference 
cycles, i.e., cycles in general economic activity, each segment being 
bounded by two consecutive reference troughs.* Within each reference 
cycle, the series is expressed in reference cycle relatives; that is, each 
observation is divided by the average value of the series during that 
reference cycle. Each reference cycle is then divided into nine stages, 
and the mean of the relatives within each stage is computed. The first 
and ninth stages each consist of three months centered on the reference 
troughs and the fifth stage consists of three months centered on the 


8 Cf. Wesley C. Mitchell and Arthur F. Burns, “The National Bureau's Measures of Cyclical Be- 
havior,” National Bureau of Economic Research Bulletin 57 (July 1, 1935), 20 pp.; or better, their forth- 
coming volume, Methods of Measuring Cyclical Behavior. 
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reference peak; stages II, III, and IV correspond with successive thirds 
of the reference expansion, stages VI, VII, and VIII with successive 
thirds of the reference contraction. The nine means thus depict the 
pattern of behavior during a reference cycle. 

Now if the series being analyzed contains cycles, any one reference 
pattern will be fairly regular, though its peaks and troughs need not 
conform to the reference dates. But an average reference cycle pattern 
made by averaging the reference stage standings over a number of ref- 
erence cycles will tend to be simply a random sequence, unless the refer- 
ence dates bear some relation to the behavior of the series—unless, in 
other words, the series conforms with general business activity. 

The nine points of an average reference cycle pattern are rather few 
for a satisfactory test of the distribution of phase durations or of the 
total number of reversals. Furthermore, both these tests or one based 
on the total sign count would reflect the intra-cycle trend, which is re- 
tained in the reference cycle patterns. But from our recursion formula® 
it is possible to answer the question: Given the number of plus and mi- 
nus signs in the average pattern, do they tend to cluster more than 
would be expected if they had occurred in a set of nine random stand- 
ings? 

To answer this question, consider the integers 1 to 8 arranged around 
a circle, with the signs of their first differences entered between them. 
The circular arrangement allows for the fact that stage IX in one cycle 
is based on the same data as stage I in the next, and eliminates the 
arbitrariness of taking up the cycle at any particular stage. Since 8 is 
the largest of the integers, the difference to one side of it must be posi- 
tive and to the other side negative. From the recursion formula, the 
probability of each of the 64 arrangements of the other six signs that 
can occur among the 5,040 permutations of the digits 1 to 7 can be com- 
puted. Actually, we need consider only those of the 64 arrangements of 
signs that produce the observed count of pluses and minuses; if there 
are m minus signs (1<m37), there are as many of these as there are 
combinations of six things taken m—1 at a time. The relevant arrange- 
ments of signs are then grouped according to the amount of clustering, 
as in Table II. The probability of as much clustering as is shown by 
each possible grouping is obtained by summing the probabilities of se- 
quences of signs corresponding with each group. 

This test is in a sense an amalgam of the test of phase durations and 
the test of turning points. Except for the circular arrangement, it is like 
the test of turning points in that the fewer the turning points the 
greater the clustering. But for a given number of turning points, the de- 


® Technical Paper, Equation 4, p. 19. 
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TABLE II 
EXACT PROBABILITY OF AS MUCH (OR MORE) REGULARITY OF PATTERN FROM 
EIGHT OBSERVATIONS ARRANGED AT RANDOM IN A CLOSED SEQUENCE AS 
IS SHOWN BY EACH POSSIBLE GROUPING OF SIGNS OF DIFFERENCES 














4:4 Ratio between signs 5:3 Ratio between signs 6:2 Ratio between signs 
Grouping* Probability Grouping* Probability Grouping* Probability 
2 -0083 : -0126 ; -0500 
- .0993 -e 1453 + - 2667 
te 2368 + 3955 oo .7167 
ie .2914 05 .6448 a 1.0000 
pete EL 

EEE 00 





* Entries under “‘grouping” indicate in the first line the lengths of runs in one sign, in the second 
line the lengths of runs in the other sign. Thus a indicates that the pluses (minuses) are contained 
in one group of 3 and one group of 1 and the eteenee (pluses) in two groups of 2 each. 
gree of clustering is greater when the signs of one kind fall mostly in a 
single group (phase) than when they are more evenly distributed among 
the groups. 
III 

The test of cyclical conformity just described is based simply upon 
the regularity of the average pattern. Some types of systematic move- 
ment, e.g., seasonal, do not imply this type of regularity—indeed, it is 
possible that a series might follow business cycles systematically, but 
in an irregular fashion. This suggests the need for another kind of test, 
one to determine, for example, whether movements during correspond- 
ing stages of successive reference cycles are consistent. Such a test 
would be one of a general class of correlation tests based on signs of first 
differences. 

Suppose it is desired to test whether there is a relation between the 
nine-point patterns of two reference cycles, the seasonal patterns given 
by twelve monthly figures for each of two years, or any two time series 
of equal length. A 2X2 table can be constructed by listing the signs of 
differences for each series and tabulating the number of instances in 
which (a) both series rise, (b) the first rises and the second falls, (c) the 
first falls and the second rises, and (d) both fall. A homogeneity test of 
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this double dichotomy, treating the marginal totals as fixed, would 
have the advantages of simplicity, generality, and elimination of at 
least the primary effects of trend. 

The usual procedures for analyzing double dichotomies are based on 
the assumption that the entries are random and independent. If each 
series is a random sequence, the differences are negatively correlated 
serially, and this means that a chance coincidence in direction of move- 
ment during one interval tends to produce another coincidence during 
the next interval. Furthermore, suppose the first series has an excess of 
positive differences; these are more likely to be located at certain parts 
of the series than at others.'® If the second series also has more plus 
than minus signs they tend to fall in the same positions, thus producing 
coincidences even though the series are unrelated; but if the second se- 
ries has more minus than plus signs, the minuses tend to fall where the 
pluses do for the first series, thus producing a spurious appearance of a 
negative relation between the two series. 

A few tabulations that we have made of the exact distribution on the 
assumption that the observations are random suggest that the usual 


10 This may seem surprising since each of the N —1 differences in a random series is equally likely 
to be positive or negative. When it is known, however, that one sign has occurred less frequently than 
the other, the occurrences of the less frequent sign are more likely to be at the center than at the ex- 
tremes of the series. This may be verified by the recursion formula referred to in footnote 9, but the 
following example illustrates the point: Suppose that of the N —1 differences in a series of N, exactly 
one is negative. If this negative difference is the mth in the series of N —1 differences, it means that (1) 
the first m observations are arranged in ascending order, (2) the last N —m observations are arranged 
in ascending order, and (3) the greatest observation among the first m exceeds the least among the last 
N-—m. Any partition of the N observations into two groups, of m and N —m respectively, provides for 
one (and only one) fulfillment of the first two conditions; so there are 

N! 


m!(N—m)! 
ways to satisfy conditions 1 and 2. Of these, all but one—that in which each observation among the 
N-—m exceeds each among the m—also satisfy the third requirement. Hence there are among the 
permutations of N different numbers exactly 
NI 
—_———_ -l 
m!(N —m)! 
which have all differences positive except the mth. This quantity summed over the range m=1 to 
m=N —1, or 2 —N —1, is the number of permutations of N different quantities that produce exactly 
one negative sign among the signs of differences. So if a random sequence of N numbers has produced 
only one negative first difference (the same reasoning applies for one positive difference), the probability 
that it is the mth difference is ‘ . 
NI —m!(N —m)! 





m!I(N — m)!(2N — N —1) 

which has a maximum at m =N/2 when N is even or maxima at m =(N —1)/2 and m =(N+1)/2 when 
N is odd, and minima at m =1 and m =N —1. When there is more than one minus sign, but fewer than 
(N —1)/2, the situation is more complex. For example, two minus signs if adjacent are most likely to be 
in the center but if non-adjacent are most likely to be about one-third of the way from each end; they 
are more likely to be non-adjacent than to be adjacent and the net result appears to be a distribution 
with three modes, one at the center and the other two a Little more than a third of the way from each end. 
As the number of minus signs rises toward (N —1)/2 the probability of a minus in any given position 
becomes more nearly uniform for all positions. 
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treatment of double dichotomies, though based on the assumption of 
random differences, may provide usable approximations unless WN is 
very small or the number of plus and minus signs within either series 
is extremely unequal. When N =8 and each series has the same number 
of signs of a given kind, the comparison is as shown in Table III. 


TABLE III 


PROBABILITIES OF VARIOUS NUMBERS OF COINCIDENCES IN POSITION FOR SIGNS 
OF DIFFERENCES IN TWO RANDOM ARRANGEMENTS OF EIGHT OBSERVATIONS, 
EACH CONTAINING m POSITIVE (OR m NEGATIVE) DIFFERENCES; AND 
COMPARISON WITH APPROXIMATION BASED ON ASSUMPTION 
THAT DIFFERENCES ARE RANDOM AND INDEPENDENT 








Probability 








m* Coincidencest 
Exact Approximatet 
1 1 0.202708 0.142857 
0 0.797292 0.857143 
2 0.070409 0.047619 
2 1 0.470286 0.476190 
0 0.459305 0.476190 
3 0.042049 0.028571 
3 2 0.342878 0.342857 
1 0.480747 0.514286 
0 0.134325 0.114286 





* m is the frequency of the less frequent sign. 

t Number of coincidences in position for the less frequent sign. The table also covers pairs of series 
in which one has m signs of one kind and the other has m signs of the opposite kind, if coincidences are 
counted for the two sets of m; in such cases inverse relationship is tested. 

t Calculated by the method given in R. A. Fisher, Statistical Methods for Research Workers, Fifth 
aad Later Editions (London and Edinburgh, 1934-40), Section 21.02. 


It may be remarked that if the two dichotomies appear to be homo- 
geneous when treated by the usual procedure, which implicitly assumes 
random differences, they surely would appear so if the proper null 
hypothesis of random observations were used; but nothing more can 
be said definitely until the sampling distribution appropriate to the as- 
sumption of random observations is known. 

It is easy to extend this method of testing correlation to more than 
two series. For example, if it is desired to test the consistency of refer- 
ence cycle behavior over several reference cycles, the signs of the differ- 
ences for the eight stage-to-stage movements in the separate reference 
cycle patterns may be entered in a table of eight columns and as many 
rows as there are cycles. By counting the number of plus signs in each 
column, and comparing the counts with those expected if the total num- 
ber of plus signs in the entire table were equally divided among the 
columns, we can see whether the movements during each interval tend 
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to be consistent from cycle to cycle, and at the same time allow for the 
effect of trend on the total number of plus signs. In testing the sig- 
nificance of this tendency, however, difficulties with respect to the null 
hypothesis again rise. If the differences can be assumed random sequen- 


P P 
tially, the variance of the column counts is ra (1 =) (where R is the 


number of rows, C the number of columns, and P the number of pluses 
in the table), and the ratio to this of the sum of squared differences be- 
tween observed and expected column counts is distributed as x? with 
C—1 degrees of freedom, provided R is not too small and the proportion 
of plus signs is not too far from one-half (when R is small or the number 
of pluses and minuses extremely unequal, exact distributions may be 
computed without great difficulty). If the appropriate null hypothesis 
is that the observations are sequentially random, we know of no valid 
procedure except the laborious one of tabulating exact distributions, 
unless the procedure assuming randomly arranged differences turns out 
to be satisfactory also for randomly arranged observations. 

The desirability of a solution to this problem is enhanced by the pos- 
sibility of using the same criterion, signs of differences, to test the ran- 
domness of the separate series. On the other hand, the technique would 
be restricted in scope by the fact that in many time series problems 
neither the hypothesis of sequential randomness in the observations 
nor that of sequential randomness in the differences is tenable. For ex- 
ample, in testing the correlation between two economic series we should 
allow for the presence of cyclical characteristics which tend to dupli- 
cate, or to cancel, any chance coincidences. (In these cases, methods 
suggested earlier may be preferable—arranging the pairs in order ac- 
cording to one variate, and counting the number of signs of one kind, 
the number of reversals, or the number of phases of each duration in 
the other variate.) 


IV 


The generality and simplicity of tests based on signs of differences is 
due to the fact that they utilize only information that is inherent in 
every sample. When there exists relevant information other than that 
contained in the sample, as for example the knowledge that the ob- 
servations are from a normal or approximately normal population, the 
tests are “inefficient” in the sense that they fail to utilize the supple- 
mentary information. Where no such additional knowledge exists, how- 
ever, they are more effective than tests of narrower scope, in that they 
do extract some information from samples to which the less general 
tests cannot be applied at all. Furthermore, even when some supple- 
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mentary information is available, a test based on signs of differences 
may be quite efficient. For example, in certain types of data the signs 
of differences are more accurate than the magnitudes of either the ob- 
servations or the differences."' Again, the problem studied may be such 
that direction of movement per se is significant and the magnitude of 
movements is irrelevant. A similar situation arises when it is desired to 
abstract from the effects of trend, for a test based on signs of differences 
is relatively insensitive to movements that are gradual in relation to the 
other movements in a series. 

The greater the random movements in a series, the more likely are 
tests based on signs of differences to overlook systematic movements. 
This is true of any significance test, for all significance tests judge the 
systematic tendencies by comparison with random fluctuations. The 
weakness of tests based on signs of differences is, however, that the ran- 
dom variation ordinarily is reduced only slightly by increasing the 
amount of data; that is, the “averaging out” process is not very effec- 
tive. It is possible to conceive of cases where increasing the length of a 
series would be highly effective (for example, in data having a trend rise 
of ten units per year, if the trend-adjusted data fall within a range of 
one unit), and also of cases where it would have absolutely no effect 
(for example, in data having a trend rise of one unit per year, if the 
trend-adjusted data never differ by less than ten units). The series on 
sweet potato acreage discussed previously lies toward the first, that on 
sweet potato production toward the second, of these extremes. 

No general statement about the efficiency or inefficiency of tests 
based on signs of differences can be valid except in the particular con- 
text of a specific problem and body of data. 

11 With economic measures of the kind for which index numbers are ordinarily used, it may be cer- 
tain that a change has been in a given direction (e.g., when all components of the index change in the 
same way), questionable how much the change has been (because of ambiguities in the weighting sys- 


tem), and meaningless to state an absolute standing. Similarly, the short-time directions of change 
of an index may be more meaningful than the long-time directions of change. 








AN APPROACH TO THE MEASUREMENT OF FARM 
POPULATION PRESSURE IN WISCONSIN 


By Tuomas C. McCormick 
University of Wisconsin 


HE CONCEPT of an optimum population is actually so basic in cur- 
co population theory that without it the latter would have little 
meaning or motive. It is, however, still essentially a philosophical ab- 
straction that is sometimes discounted or even denied. In spite of its 
important social consequences, the idea has never been the subject of 
serious quantitative testing and study in concrete situations. The 
present paper reports the results of a quantitative approach to one 
aspect of the problem recently made in Wisconsin, which may be of 
interest as a methodological venture. 

From the agricultural point of view, an optimum population implies 
that the per capita income of a farm population is in part a function of 
the ratio of the size of the population to the amount of land farmed. 
Accordingly, if interfering factors can be controlled, variations in the 
acreage of farm land per capita of the farm population from county! to 
county should be accompanied by differences in the per capita farm 
income, which should range from a minimum to a maximum. 

Only data published by the state and federal governments were 
available. Since as a rule such large-scale data can come only from 
government sources, this was regarded as one of the conditions of the 
problem. 

The search for an index of the income of the farm population was 
able to discover nothing better than a detailed and careful estimate of 
gross farm income by counties made by the Wisconsin Department of 
Agriculture and Markets in 1931 (no later estimates have been pre- 
pared). These county estimates were deflated and reduced to a per 
capita basis of the rural farm populations? in 1930. Although the index 
is not without faults, it was found to be rather highly correlated* with 
other indexes of agricultural prosperity, such as the value of crops and 
livestock products per capita of the rural-farm population in 1930 
(r?=.81), the percentage of farm homes with automobile, radio, and 
telephone (r?=.62), and the per capital value of farmers’ dwellings 


1 The county was the smallest unit for which data were published. Data were lacking for a time 
series analysis. 

2 Wisconsin Agriculture, Bulletin No. 140, p. 45, 1932; and 1930 Census of the United States, Popu- 
lation, Vol. III, Pt. 2, p. 1328. The gross farm income is the estimated value of all products grown. This 
was divided by 146 and multiplied by 100 to give the deflated index. The index of prices paid by farmers 
for commodities bought in 1930 is 146, with 1910-14 as the base period. See Wisconsin Agriculture, 
op. cit., p. 10, column 20. 

3 Computed from a test sample. 
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(r? =.64). This index, because it represents the dependent variable, is 
represented by the symbol Xo, and will be referred to as “per capita 
income.” The independent variable, the acreage of farm land per 
capita of the rural-farm population, was of course readily obtained for 
each county by dividing the total farm acreage by the rural-farm popu- 
lation‘ in 1930. It has the symbol Xi, and will be called the “land-man 
ratio.” In studying the relationship between the per capita income 
(Xo) and the land-man ratio (X;), the chief difficulty lay in the control 
of interfering factors. As usual in social research, this problem was only 
partially solved by holding constant farm land values per acre’ (X;) 
and type of farming® (X;3). 


GENERAL METHOD AND RESULTS 


The data and method of analysis are shown in Tables I through VI. 
Omitting the two highly urbanized counties of Milwaukee and Keno- 
sha, the remaining sixty-nine counties in Wisconsin are divided into 
three broad types of farming areas, each area occupying two tables to 
itself. In Tables I, III, and V the counties in each area are classified 
first by mean value of farm land per acre (X2), and second by land-man 
ratio (X,). In any land-value class (row) in an area table (Tables I, III, 
and V), both type of farming and land value per acre are held roughly 
constant, so that variations in mean per capita income (Xo) from one 
land-man ratio class (column) to another can probably be attributed to 
differences in the land-man ratio (X,) and (unknown) associated fac- 
tors. It can be seen by inspection that in every row (of the odd-num- 
bered tables) the mean per capita income (Xo) increases as the land- 
man ratio (X;) increases. The actual amount of this relationship, with 
land value (X_2) and type of farming (X3) constant, is roughly measured 
by the squared partial correlation ratio, 701.23, calculated for each row 
and averaged over the table. The 11.23? values for Tables I, III, and V 
are .63, .68, and .49, respectively. 

On the strength of the relationship just demonstrated between per 
capita income (Xo) and the land-man ratio (X;), it is assumed that 
those counties that have the highest land-man ratio in a given row rep- 
resent, from the point of view of per capita farm income, the most 
favorable adjustment of farm population to farm land in their class; 
and the mean land-man ratio of these “top” counties is taken as the 
criterion for adjusting upward the land-man ratios of the other counties 

4 1930 Federal Census, Agriculture, Vol. II, Pt. 1, p. 736. 
5 1930 Federal Census, Agriculture, Vol. II, Pt. 1, pp. 736, 748. Deflated by dividing value per acre 


by 115 and multiplying by 100. The index of farm real estate values in 1930 is 115, with 1910-14 as 
base period. See Wisconsin Agriculture, op. cit., p. 10, column 22. 

* Type of farming areas were taken with modifications from P. E. McNall and W. J. Roth, Forces 
Affecting Wisconsin Agriculture, with Resulting Types of Farming, Research Bulletin 131, Agricultural 
Experiment Station of the University of Wisconsin, November, 1935. 
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in the same class (row). The farm population required to raise the land- 

man ratio of a county to that of the criterion counties in the same row 

is found by dividing the farm acreage of the county in question by the 

mean land-man ratio of the criterion counties.’ This is called the “ad- 

justed” farm population. A county index of relative population pressure 
TABLE I 


TYPE OF FARMING AREA I, TWENTY-TWO WISCONSIN COUNTIES: 
METHOD OF ANALYSIS* 



































Xs xX 18-23 24-29 30-33 Totals 
n 3 0 2 5 
Xo 214.3 246.5 227.2 
30-44 rXe 643 493 1136 
rXe 143 ,605 121,549 265,154 
(2X0)*/n 137 ,816 121,525 258,099 
n ~ 4 1 13 
Xo 214.9 303.0 328.0 250.7 
45-59 rXe 1,719 1,212 328 3,259 
EX? 376,641 369 , 882 107 , 584 854,107 
(2X0)*/n 369,370 367 ,236 107 ,584 817 ,006 
n 4 0 0 4 
Xe 252.8 252.8 
60-84 rXe 1,011 1,011 
rXet 256 ,677 256 ,677 
(2Xe)2/n 255,530 255,530 
n 15 4 3 22 
Xe 224.9 303.0 273.7 245.7 
=X 3,373 1,212 821 5,406 
Totals DX 776,923 369 , 882 229 , 133 1,375,938 
(2Xe)2/ne 758,475 367 ,236 224 ,680 1,350,391 
=" [(2Xe)?/nr] 1,330,635 
EP [(LXe)2/nre] 762,716 367 , 236 229,109 1,359,061 
(2X0)2/N 1,328,402 





* For meaning of symbols, see text. 

Note: Tables I, III, and V would be somewhat more exact if the calculations had been weighted 
by the rural farm populations of the counties. In the case of estimates as rough as these, however, the 
extra labor required did not seem justified. 


in agriculture, symbolized by the letter J, is then obtained® by dividing 
the adjusted farm population by the census rural farm population in 
1930, multiplying by 100, and subtracting from 200. (See column 7, 
Tables II, IV and VI.) Thus, an index of 120 means that the rural farm 
population of a county must be reduced by 20 per cent in order to pro- 
duce a land-man ratio equal to that of the criterion counties in the same 
land-value class and type of farming area, which of course have an in- 
dex of 100. 


7 Or, the adjusted farm population may be obtained by multiplying the observed rural farm popu- 
lation of the county by its index of population pressure, J, as described in the text, subtracted from 200 
and expressed as a decimal. 

* Or, the land-man ratio of the county may be divided by the criterion land-man ratio and the 
quotient subtracted from 2 and multiplied by 100. 
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TABLE II 
TYPE OF FARMING AREA I: OBSERVED DATA AND ESTIMATES* 




























































































‘ oe Bae ’ = o PI! 
No. County Xe Xi | X: F F.I. I Xo I =X XP 
1 | Jefferson 276 | 23.8 | 44 | 13,604 | 3,750,863 || 126 | 302 | 10,067 | 3,040,234 
2 | Outagamie | 190 | 18.8 | 44 | 17,843 | 3,398,904 || 141 | 231 | 10,527 | 2,431,737 
3 | Shawano 177 | 22.6 | 32 | 19,200 | 3,394,589 || 129 | 206 | 13,632 | 2,808,192 
4 | Columbia | 250 | 32.1 | 33 | 13,876 | 3,469,247 || 100 | 250 | 13,876 | 3,469,247 
5 | Green Lake | 243 | 33.0 | 38 | 6,248 | 1,516,849 || 100 | 243 | 6,248 | 1,516,849 
6 | Brown 184 | 17.9 | 49 | 16,534 | 3,035,274 || 134 | 311 | 10,912 | 3,393,633 
7 | Door 171 | 23.0 | 47 | 10,043 | 1,720,137 || 115 | 227 | 8,537 | 1,937,899 
8 | Fond du Lac | 251 | 23.1 | 52 | 18,737 | 4,695,548 || 115 | 306 | 15,926 | 4,873,356 
9 | Kewaunee | 188 | 21.6] 51 9,601 | 1,807,808 || 120 | 264 | 7,681 | 2,027,784 
10 | Manitowoe | 207 | 19.7 | 47 | 17,791 | 3,685,205 || 127 | 309 | 12,987 | 4,012,983 
11 | Sheboygan | 235 | 18.7 | 56 | 15,980 | 3,748,288 || 131 | 351 | 11,026 | 3,870,126 
12 | Washington | 257 | 20.2 | 51 | 12,903 | 3,312,192 || 125 | 352 | 9,677 | 3,406,304 
13 | Winnebago | 226 | 21.1 | 50 | 12,004 | 2,712,671 || 122 | 309 | 9,363 | 2,893,167 
14 | Dane 273 | 24.7 | 54 | 28,763 | 7,856,644 || 100 | 273 | 28,763 | 7,856,644 
15 | Dodge 299 | 24.4 | 54 | 21,112 | 6,316,027 || 100 | 299 | 21,112 | 6,316,027 
16 | Rock 296 | 27.1 | 47 | 15,442 | 4,570.137 || 100 | 296 | 15,442 | 4,570,137 
17 | Walworth | 344 | 27.5 | 50 | 11,380 | 3,919,726 || 100 | 344 | 11,380 | 3,919,726 
18 | Green 328 | 32.2 | 50 | 11,077 | 3,628,767 || 100 | 328 | 11,077 | 3,628,767 
19 | Calumet 229 | 19.5 | 61 | 9,864 | 2,259,041 || 100 | 229 | 9,864 | 2,259,041 
20 | Ozaukee 246 | 17.9 | 75 | 7,386 | 1,818,973 || 100 | 246 | 7,386 | 1,818,973 
21 | Racine 262 | 18.6 | 80| 9,419 | 2,466,507 || 100 | 262 | 9,419 | 2,466,507 
22 | Waukesha | 274 | 19.5 | 75 | 15,222 | 4,175,274 || 100 | 274 | 15,222 | 4,175,274 
Totals 314,078 77,267,671 || | [270, 124 \76 692,607 
Means 246t| 23.0 | 52 | 1144] 2sit| 














* Meaning of symbols used as column headings: 
X-« is deflated gross farm income per capita of the rural farm population, 1931. 
X:1is acres of farm land per capita of the rural farm population, 1930, referred to as 
land-man ratio. 
X:is the deflated value of farm land peracre, 1930. 
P is the rural farm population, 1930. 
F.I. is the total deflated gross farm income, 1931. 
I is the index of relative farm population pressure. 
X°’ is the adjusted deflated gross farm income per capita of the rural farm population, 1931 
P’ is the adjusted rural farm population, 1931. 
F’.I’. is the adjusted total deflated gross farm income, 1931. 
t Weighted by the rural farm populations of the counties. 

By this method absolute optima populations are avoided and relative 
optima, established by the counties themselves, are used that change 
as the value of farm land changes. 

Next, a rough estimate is made of the probable effect of the increased 
land-man ratios on the farm income per capita in each adjusted county 
and on the total farm income in the type of farming area. The results 
show that while increasing the number of acres per capita always in- 
creases per capita income, in each of the three types of farming areas It 
reduces the total farm income because the returns from a fixed amount 
of farm land decrease as cultivation becomes less intensive (cf. the 


F.I. and F.’I.’ columns of Tables II, IV, and VI). This appears again 
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in the fact that in Area II, for example, the coefficient of linear correla- 
tion between the land-man ratio and farm income per acre is r= —.68, 
or r?=.46. The operation of this principle is familiar in the thickly 
populated countries of Europe and Asia, where returns per acre are 
high but returns per worker are very low. 

Finally, the investigation shows that in each type of farming area 
in Wisconsin relative over-population arises in greater degree from too 
many farm families than from too large farm families. There is a slight 
tendency for farm families to be larger in counties where the farms are 
smaller and incomes lower. 

The steps in this analysis will be shown in some detail for type of 
farming Area I. In the cases of the other two areas only the end results 
will be given, except where certain points need explanation. 


TYPE OF FARMING AREA I 
Type of farming Area I, represented by Tables I and II, includes 
twenty-two counties located in the southern and eastern parts of the 
state. The dominant farm enterprise is dairying. The region is distin- 
guished by the highest gross farm income per capita of the rural farm 
population, the weighted mean® being $246 (column Xo, Table II). The 
area also has the highest priced farm land, with a range of $32 to $80 
and a mean of $52 per acre (column X¢2, Table II). The mean land-man 
ratio, however, is only 23 acres of farm land per capita of the rural farm 
population, which is smaller than that of either of the other two areas 
(column X,, Table IT). 
The relationship between per capita income (Xo) and land-man ratio 
(X;) in this area, with type of farming (X3) and value of land (X_) con- 
stant, may be found from the formula 


= ( 5 xX.) = ( > xX. 
fi“ - f= 
7701.28 = - - (1) 


Ebr - Eee 


¢c 

Ny 
where r refers to row, c to column, rc to cell, and n to frequency. In us- 
ing this formula it is well to omit (from both numerator and denomina- 
tor) any row that nas less than two cells containing two or more Xo en- 
tries each. Substituting from Table I,!° 





* Unless otherwise stated, all area means are unweighted averages of the county means. Weighting 
was usually found to make no practical difference in the results. 

10 It seemed advisable to avoid Bessel’s correction and the correction for broad grouping. The 
assumptions of the latter make it invalid in this study, and since the two corrections tend to offset one 
another, it would be misleading to use the former alone. If both were made in the usual way, the values 


of y* would be materially raised here. 
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2 1,359,061 — 1,330,635 


No1.23 = 


1,375,938 — 1,330,635 —~ 





That is, roughly 63 per cent of the variance (standard deviation 
squared) in per capita income from county to county may be attributed 
to differences in the land-man ratio and associated factors, with the in- 
fluence of differences in land values and type of farming removed. 

It is also possible to estimate the extent to which per capita farm 
income reflects the combined influence of the two factors, land-man ra- 
tio, X;, and land value per acre, Xe, with type of farming, X3, held con- 
stant. For this purpose the following formula may be devised, 


tex- 222%) 


2 
Ho.) = 1 — (2) 








Sym -—— 


where N is the total frequency of the table. In making such an estimate, 
cells that contain less than two entries should be omitted entirely. The 


value here is 

1,375,938 — 1,359,061 

(5406 — 328)? 
21 


= .58. 





2 
Howe) = 1 — 





1,375,938 — 107,584 — 


But H’o.12:3) should be greater than 779.23 if the factors X; and X,2 are 
held actually constant in the cells. As the X; and X, categories in Table 
I become narrower the value of H?o.12;3) increases. To correct H*o.19;3) for 
broad categories we may find 702.3 for the whole table 


. (Es) (Ex) 


2 











2 Nr N 
no2.3 = - : (3) 
.. , (2%) 
X: _ 
Ld Xs N 
2 1,330,635 — 1,328,402 
= = .05 





,a3s 


1,375,938 — 1,328,402 
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and then use the formula 


2 5 2 : 
Ho.12(3) = "o1.23(1 — mov.s) + 02.8 (4) 














¢c 


Ho.12@) = -63(.95) + .05 = .65. 
We therefore assume that something like 65 per cent of the variance in 
influence of the land-man ratio and land values, leaving 35 per cent un- 
explained. 
re 2 r 2 
(Ex). (Ex) 
p a - 2 
702.13 = : (5) 
(Ex) 
, 1,359,061 — 1,350,391 

1,375,938 — 1,350,391 





per capita income between the counties is attributable to the combined 
By a formula similar to (1), 
2 Nre Ne 
r c 2 
2, 2s%e- 2, 
02.13 = 


A comparison of 7792.13 =.34 with 7% 1.23 =.63 shows that the relation- 
ship between the land-man ratio and per capita income is much closer 
than that between the per capita income and the value of farm land per 
acre. 

In Table I it is seen that five counties have farm land with mean val- 
ues ranging between $30 and $45 an acre. Three of the counties, Sha- 
wano, Outagamie, and Jefferson, have a mean land-man ratio of 21.7 
acres and a mean per capita income of $214. The other two counties, 
Green Lake and Columbia, have a mean land-man ratio of 32.6 acres 
and a per capita income of $247. If, in view of our correlation analysis 
above, we attribute the difference of about $32 in per capita income be- 
tween these two groups of counties to the difference in their land-man 
ratios (and unknown associated factors), we may estimate that each in- 
crease of one acre in land-man ratio is accompanied by a mean gain of 
246.5 —214.3 


32.0 —21.7 
land-man ratio of the three “iow” counties to 32 acres, we divide 32 into 
the land-man ratio of each of these counties, i.e., into 22.6 in the case of 
Shawano, into 18.8 in the case of Outagamie, and into 23.8 in the case 
of Jefferson, and subtract the resulting percentage from 200 in each 
case. We then obtain as the county indexes of relative farm population 
pressure in this land-value class: Jefferson 126, Outagamie 141, Sha- 





roughly $3.13 in per capita income ( ). To raise the mean 
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wano 129 (column J, Table IT). That is, the rural farm population of Jef- 
ferson county would have to be reduced 26 per cent in order to provide 
32 acres of farm land for the support of each individual member. Co- 
lumbia and Green Lake, the criterion counties in this group, each has a 
population pressure index of 100, and their farm populations are left 
unchanged. When the farm populations of Jefferson, Shawano, and 
Outagamie counties are reduced in this way, the adjusted farm popula- 
tion for each of the three counties may be found by multiplying its 1930 
rural farm population by the difference between 200 and the county’s 
index of population pressure, expressed as a decimal. For example, for 
Jefferson county, with a rural farm population of 13,604 in 1930, the 
adjusted population is 13,604 (2.00 — 1.26)=10,067. For the other 
two counties the 1930 rural farm populations and their adjusted values 
are: Outagamie, 17,843 reduced to 10,527; Shawano, 19,200 reduced to 
13,632 (column P’, Table IT). 

If such adjustments in the land-man ratios of Jefferson, Outagamie, 
and Shawano counties were made, we infer that each county would 
probably have its per capita farm income increased by an amount equal 
roughly to the increase in its land-man ratio multiplied by the figure 
$3.13 obtained above. Thus, in the case of Jefferson county, the land- 
man ratio is increased from 23.8 acres to 32 acres, or 8.2 acres, so that 
the gain in its per capita income is 8.2 X3.13 =$25.67. Since the ob- 
served per capita income of Jefferson is $276, its adjusted per capita 
income is 276+25.67 = $301.67, or about $302. Similarly, in Outagamie 
county the per capita income is raised from $190 to $231, and in Sha- 
wano county from $177 to $206 (column X9’, Table IT). 

By repeating these simple calculations for each land-value class in 
Table I, Table II (to the right of the upright double ruling) was worked 
out for the twenty-two counties in Area I. The land-man ratio of each 
of the counties numbered 6 through 13 in Table II was raised to 27 acres, 
and the addition of each acre was estimated to increase the per capita 
income by $14.00. 

The weighted index of farm population pressure for the whole of 
Area I is 114 (Table ITI). It is found by dividing the total observed rural 
farm population (P) into the total adjusted farm population (P’), and 

270,124 


314,078 
ble II). That is, a reduction of 14 per cent in the rural farm population 
of this prosperous dairy section in southern and eastern Wisconsin is 
needed to raise the land-man ratios in all counties to a par with the land- 
man ratios observed in the counties with the highest land-man ratios in 
their respective land-value classes. A population of about 44,000 (i.e., 


subtracting from 200, or 200— = 114 (columns P and P’ in Ta- 
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314,078 X.14) would be released from the farms, chiefly through migra- 
tion to the cities, partly through birth control. As a result of such a 
reduction in the rural farm population, it is likely that the total farm 
income for the area would be decreased from $77,267,700 to $76,692,600 
(columns F.J. and F.’I.’, Table II), a loss of $575,000, or a little less than 
one per cent. The weighted per capita income of the remaining farm pop- 
ulation, however, would be raised about $38, or 15 per cent, as a result of 


77,267,671 _ 76,692,607 
an increase from $246 (ic, —) to $284 (ic, Snr) 
314,078 270,124 


(columns P, F.J., P’, and F.’J.’, Table II). 


TYPE OF FARMING AREA II 


Type of farming Area II (Tables III and IV) lies in southern and 
western Wisconsin, and also contains twenty-two counties. It is a region 
of dairy, livestock, and mixed farming. Farm land ranges in mean value 
from $28 to $46 an acre, with a mean for the area of $35 an acre. The 
weighted gross farm income per capita of the rural farm population is 
$199. It is somewhat poorer than Area I but more prosperous than 
Area III. The land-man ratio averages 28.8 acres per capita of the rural 
farm population, compared with 23 acres in Area I. The values 702.13 
= .33; 7°o1.23 = -68; and H*p 12:3) =.71 closely resemble the corresponding 
values found for Area I, and again indicate a strong relationship be- 
tween per capita farm income and the land-man ratio. The estimates for 

TABLE III 


TYPE OF FARMING AREA II, TWENTY-TWO WISCONSIN COUNTIES: 
METHOD OF ANALYSIS* 





























Xs Xi 20-25 26-31 32-37 38-41 Totals 
n 7 7 1 2 17 
Xo 181.1 203.9 196.0 245.0 198.9 
25-39 EX 1,268 1,427 196 490 3,381 
rXo? 231,394 292,999 38,416 120,212 683 ,021 
(2X0)2/n 229 ,689 290 , 904 38,416 120,050 672,421 
n 1 2 2 0 5 
Xo 181.0 224.0 245.5 224.0 
rXe 181 448 491 1,120 
40-49 DXo? 32,761 101,152 120,292 254 , 205 
(=Xo0)2/n 32,761 100 ,352 120,541 250,880 
n 8 9 3 2 22 
Xo 181.1 208.3 229.0 245.0 205.0 
rXe 1,449 1,875 687 490 4,501 
DX? 264,155 394,151 158,708 120,212 937 , 226 
Totals | (2Xo)2/ne¢ 262,450 390,652 157 ,323 120,050 930,448 
D"[(2Xo0)2/ny] 923 ,301 
Ee [(DXo)2/nre] 262 , 450 391,256 158 , 957 120,050 932,713 
(=Xe)2/N 920 , 864 




















* For meaning of symbols, see text. 
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TABLE IV 
TYPE OF FARMING AREA II: OBSERVED DATA AND ESTIMATES* 











30 | 15,018 3,208,562 || 129 | 257 | 10,663 | 2,740,391 
31 | 15,100 3,102,466 || 12 245 | 11,174 | 2,737,630 
28 9,323 1,824,384 |} 112 | 214 8,204 | 1,755,656 


13 St. Croix 214 | 28. 
14 Trempeleau | 205 | 29 
15 Crawford 196 | 35. 


F - Fae 

No. County Xe Xi X: P F.I. I Xe i =X,’ XP’ 
1 Barron 189 | 21.6 | 36 | 21,916 4,012,329 || 146 | 258 | 11,446 | 2,953,068 
2 Chippewa 183 | 25.8 | 33 | 18,131 3,326,164 || 135 | 236 | 11,785 | 2,781,260 
3 Clark 187 | 20.5 | 37 | 24,533 4,588 151 || 149 | 260 | 12,512 | 3,253,120 
4 Marathon 162 | 20.7 | 35 | 34,455 5,568,151 |} 148 | 234 | 17,917 | 4,192,578 
5 Pierce 211 | 25.5 | 37 | 14,432 2,827,877 || 136 | 265 9,236 | 2,447,540 
6 Polk 167 | 23.5 | 34 | 18,953 3,173,630 || 141 | 229 | 11,182 | 2,530,678 
7 Wood 169 | 22.3 | 37 | 13,725 2,319,452 || 144 | 213 7,686 | 1,637,118 
8 Dunn 204 | 28.8 | 34 | 16,899 3,441,986 || 128 | 246 | 12,167 | 2,993,082 
9 Eau Claire | 181 | 28.5 | 31 | 10,352 1,878,082 || 129 | 224 7,350 | 1,646,400 
10 La Crosse 236 | 30.8 | 32 8,877 2,091,438 |} 123 | 271 6,835 | 1,852,285 
11 Monroe 204 | 29.4 | 32 | 15,833 3,223,356 || 122 | 244 | 12,350 | 3,013,400 
12 Pepin 183 | 28.8 | 31 4,677 853 ,836 || 128 | 225 3 ,367 757 ,575 

5 

4 

2 

3 

5 



























































16 | Buffalo 254 | 40.3 | 28 | 10,134 | 2,572,740 || 100 | 254 | 10,134 | 2,572,740 
17 Iowa 236 | 39.5 | 39 | 11,557 | 2,725,274 || 100 | 236 | 11,557 | 2,725,274 
18 | Vernon 181 | 25.5 | 42 | 19,193 | 3,476,233 || 125 | 229 | 14,395 | 3,296,455 
19 Richland 204 | 29.5 | 40 | 12,146 | 2,479,452 || 113 | 229 | 10,567 | 2,419,843 
20 | Sauk 244 | 30.2 | 40 | 16,232 | 3,953,082 || 111 | 265 | 14,446 | 3,828,190 
21 Grant 234 | 34.6 | 44 | 19,594 | 4,578,699 || 100 | 234 | 19,594 | 4,578,699 
22 Lafayette | 257 | 34.1 | 46 | 10,995 | 2,843,425 | 100 | 257 | 10,995 | 2,843,425 
Totals | | | |341,355 | 68,068,769 | 245,562 |59,586,407 
Means 199+] 28.8 | 35 | | 1284) 243t 











* For meaning of symbols, see Table II. 
t Weighted by the rural farm populations of the counties. 


Area II are collected in Table IV. A land-man ratio of 40 acres was 
taken as the criterion for counties numbered 1 through 17; while for 
counties numbered 18 through 22 the criterion was 34 acres. In the first 
case the addition of each acre to the land-man ratio increased the per 
capita income by about $3.75, in the second case by about $5.62. 

Area IT shows a weighted index of farm population pressure of 128, 
compared with 114 for Area I. To reduce this index to 100 would require 
the removal of some 95,800 people from the farms of the twenty-two 
counties. Such a reduction in the farm population of the area would be 
accompanied by a loss in total gross farm income estimated at 12 or 13 
per cent ($8,482,000). On the other hand, the weighted per capita in- 
come would be increased 22 per cent, from $199 to $243. 


TYPE OF FARMING AREA III 
The third and last type of farming area, covered by Tables V and VI, 
includes twenty-five counties. Nineteen of these counties fall in the 
northern part of the state, six in the central part. The area includes the 
northwestern sandy dairy-potato and dairy-hay regions, the central 








TABLE V 


TYPE OF FARMING AREA III, TWENTY-FIVE WISCONSIN COUNTIES: 
METHOD OF ANALYSIS* 









































Xs Xi 14-19 20-25 26-31 32-37 38-47 Totals 
n 3 5 3 1 2 14 
Xs 83.0 118.4 144.3 170.0 153.5 125.1 
10-24 | =Xe 249 592 433 170 307 1,751 
=X 20,981 71,110 64,769 28,900 48,220 | 233,980 
(2X0)2/n 20 ,667 70,093 62,496 28,900 47,125 | 219,100 
n 3 7 1 0 0 11 
Xe 116.0 136.1 165.0 133.3 
=Xe 348 953 165 1,466 
25-39 | 2X 43,322 | 135,863 27 ,225 206 ,410 
(2X0)*/n 40,368 | 129,744 27 ,225 195,459 
n 6 12 4 1 2 25 
XxX. 99.5 128.8 149.5 170.0 153.5 128.7 
zXo 597 1,545 598 170 307 3,217 
rXe 64,303 | 206,973 91,994 28,900 48,220 | 440,390 
Totals | (2Xe)2/ne 59,400 | 199,068 89,400 28,900 47,125 | 423,893 
2° [(2Xe)*/ne] 414,559 
zr [(EX0)2/nre] 61,035 | 199,837 89,721 28,900 47,125 | 426,618 
(2X-0)2/N 413,964 
* For meaning of symbols, see text. 
TABLE VI 


TYPE OF FARMING AREA 


III: OBSERVED DATA AND ESTIMATES* 


















































No County | Xe| X: |X| P FI. 1 |x| P | “ade pe 
1 | Forest 84] 19.6 | 22] 3,116] 263,219 || 156 | 125| 1,371 | 171,375 
2 | Iron 70 | 14.7| 22] 2,995] 210,411 || 167 | 119 988 | 117,572 
3 | Price 95 | 19.1] 19] 9,561] 910,890 || 158 | 137 | 4,016 | 550,192 
4 | Ashland 123 | 21.6 | 23} 5,086 624,109 || 152 | 161| 2,441| 393,001 
5 | Bayfield 126 | 23.2 | 22} 8,338 | 1,053,014 || 148 | 161 | 4,336 | 698,096 
6 | Florence 126 | 24.3 | 20} 1,664] 209,315 || 146 | 160 so9 | 143,840 
7 | Marinette | 127 | 21.7 | 22 | 12,012 | 1,522,877 || 152 | 165 | 5,766 | 951,390 
8 | Sawyer 90 | 22.9] 23| 4,467| 403,973 || 149 | 126 | 2,278 | 287,028 
9 | Burnett 134 | 27.4 | 21] 7,574 | 1,014,041 || 139 | 163 | 4,620] 753,060 
10 | Jackson 182 | 31.1 | 23 | 11,244 | 2,042,740 || 131 | 205 | 7,758 | 1,590,390 
11 | Washburn | 117 | 28.7| 21} 5,860] 683,151 || 136 | 143 | 3,7 536 ,250 
12 | Waushara | 170 | 34.4 | 21 | 9,527 | 1,618,630 || 124] 187 | 7,241 | 1,354,067 
13 | Adams 130 | 46.7 | 13} 5,524] 718,699 || 100 | 130| 5,524| 718,699 
14 | Marquette | 177 | 43.6 | 18 | 5,781 | 1,020,479 || 100 | 177 | 5,781 | 1,020,479 
15 | Langlade 143 | 18.9 | 35 | 9,284 | 1,320,479 || 139 | 188 | 5,663 | 1,064,644 
16 | Taylor 133 | 19.7 | 28 | 12,519 | 1,666,849 || 136 | 175 | 8,012 | 1,402,100 
17 | Vilas 72 | 16.2 | 27| 2,298| 166,370 || 148] 127| 1,195 | 151,765 
18 | Douglas 111 | 21.6 | 28] 7,870] 873,836 || 130 | 146 | 5,509 | 804,314 
19 | Lineoln 134 | 23.3 | 29} 8,195 | 1,101,781 || 125 | 163 | 6,146 | 1,001,798 
20 | Oconto 141 | 20.6 | 31 | 14,812 | 2,087,534 || 134 | 180 | 9,776 | 1,759,680 
21 | Oneida 104 | 25.2 | 27| 3,936] 407,945 || 119 | 126 | 3,188 | 401,688 
22 | Portage 127 | 25.8 | 28 | 16,256 | 2,063,904 || 117 | 146 | 13,492 | 1,969,832 
2 Rusk 134 | 20.3 | 26] 9,844 | 1,315,959 || 135 | 174 | 6,399 | 1,113,426 
24 | Waupaca 202 | 25.1 | 35 | 16,297 | 3,299,658 || 119 | 224 | 13,201 | 2,957,024 
25 | Juneau 165 | 31.4 | 26 9,690 | 1,597,329 || 100 | 165 | 9,690 | 1,597,329 

Totals 203,750 |28,197,192 139,040 {23,509,039 
Means | 1381] 25.1 | 24 132t| 169+ 


























* For meaning of symbols, see Table II. 
t Weighted by the rural farm populations of the counties. 
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sandy general farming region, and the central sandy-loam dairy-potato 
region. Farm land values are much lower than in either of the other two 
areas, ranging from $10 to $39 per acre, with a mean of $24 per acre. 
The weighted per capita farm income, only $138, is also the smallest 
among the three areas. The mean land-man ratio of 25 acres falls be- 
tween the land-man ratios of Areas II and I. The values of the correla- 
tion ratios are somewhat less than those for the other two areas: 776.23 
= 49; n7o2.13=.19; Ho.12:3) =.50. However, there is again a substantial 
relationship between per capita farm income and land-man ratio. The 
criteria used were: among counties numbered from 1 through 14 in Ta- 
ble VI, a land-man ratio of 45 acres, with an increase in per capita in- 
come of $1.62 for each acre added to the land-man ratio; among coun- 
ties numbered from 15 through 25, a land-man ratio of 31 acres, with 
an increase of $3.73 in per capita income for each acre added to the land 
man ratio. 

The weighted index of farm population pressure for Area III is 
132, which is the highest among the three areas. Some 64,700 persons 
would have to be withdrawn from the rural farm population in order 
that all counties in the area might attain the land-man ratios of the 
criterion counties. Such a population adjustment would result in an es- 
timated reduction of about 17 per cent ($4,688,000) in the total gross 
farm income, but would raise the weighted mean farm income per cap- 
ita 22 per cent, from $138 to $169. 


CHANGES IN COUNTY FARM POPULATIONS, 1930-1940, IN RELA- 
TION TO 1931 INDEXES OF POPULATION PRESSURE 


Between the censuses of 1930 and 1940 the rural-farm populations 
of the counties with indexes of population pressure greater than 100 in 
1931 decreased about 3 per cent on the average in Area I and also in 
Area II; whereas the farm population of the criterion counties with in- 
dexes of 100 increased about 3 per cent in Area I and remained prac- 
tically stationary in Area II. In type of farming Area III the farm 
population of the counties with indexes of population pressure above 
100 increased nearly 5 per cent over the same period, while that of the 
criterion counties increased only about one per cent. 

These figures suggest that in Area I adjustments in the size of the 
farm population have recently taken place that are tending to equalize 
population pressure on agricultural resources throughout the area at an 
average level above that of the criterion counties but below that of the 
counties with population indexes in excess of 100 in 1931. In type of 
farming Area II the slight recession of farm population in the criterion 
counties and the more marked decline in the other counties is also work- 








- MEASUREMENT OF FARM POPULATION PRESSURE IN WISCONSIN 177 


ing toward equalization of pressure, but at a level below that of the 
criterion counties in 1931. By contrast, in Area III the inequalities of 
1931 between the criterion counties and the more crowded counties 
have been increasing. This is further evidence that the major problem of 
population pressure on agricultural resources is to be found in the 
northern and central parts of the state, particularly in the so-called 
“Cut-Over” region. 


SIZE OF FARM VS. SIZE OF FARM FAMILY IN RELATION TO 
INDEX OF FARM POPULATION PRESSURE 


When the indexes of farm population pressure are linearly correlated 
with the percentage of young people under twenty years of age" in the 
rural-farm population in 1930 by counties in each type of farming area, 
and the three coefficients so obtained are averaged, we find 7? = .25. 
Similarly, between the indexes of population pressure and size of farm 
family’ the mean # = .14; and between the indexes of population pres- 
sure and mean size of farm™ the mean # = — .91, or # = .83. If we repre- 
sent the indexes of population pressure by X,, the size of farm by Xz, 
and the size of farm family by Xz, and obtain coefficients of linear par- 


tial correlation, we find #55 = — .86, or #45.6 = .74; and 4.5 = .49, or 
744.5 = .24. Finally, the simple linear correlation between size of farm 
and size of farm family is mean # = — .20, or #? = .04. The interpretation 


of these coefficients has been given above under “General Method and 
Results.” 


11 1930 Federal Census, Population, Vol. III, Pt. 2, p. 1307. 
12 [bid., Agriculture, Vol. II, Pt. 1, p. 736. 
3 Ibid., Population, Vol. VI, Families, p. 1468. 


Nore: It will be noticed that all of the data used in this study were taken from the 1930 Federal 
Census, except the gross farm income by counties, which was furnished by the Wisconsin Department 
of Agriculture and Markets. Unfortunately, reliable estimates of farm income by counties are rarely 
available in any state, including Wisconsin. A fairly good substitute, however, may now be obtained 
from the Federal Census. This is the value of all farm products sold, traded, or used by farm households 
(Agriculture, Third Series, County Table XVII, 1940). The squared coefficient of correlation between 
these values of farm products and the gross farm incomes (Xo) used in this study, both on a per capita 
basis, is r? = .81, as already mentioned. 

A study dependent on county data drawn solely from the Federal Census and publications of a 
state department of agriculture, of course, should not be used alone to recommend or guide any actual 
adjustments of the farm population to the land resources of a county or state. When such adjustments 
are made, as in the case of the resettlement projects of the Federal Government in northern Wisconsin, 
such studies as this should be supplemented by intensive economic, farm management, and sociological 
studies conducted in the field. 











THE PRODUCTION FUNCTION FOR 
CANADIAN MANUFACTURES* 


By Patricia Dar 
University of Chicago 
AND Paut H. Dovetas 
Captain, U.S.M.C.R., and University of Chicago 


EADERS Of this JoURNAL may perhaps be somewhat acquainted 
R with a series of articles by one of the present authors and various 
associates which on the basis of (1) time-series and (2) cross-section 
studies of labor, capital, and product have sought to discover by use of 
the production formula, P=bL‘Ci, the effects upon the quantity of 
manufacturing product caused by changes in the quantities of labor 
and capital. Thus far the economies studied have been the manufactur- 
ing industries of the United States! and Australia.? The present study 
gives empirical values to the parameters of the above function for the 
Canadian manufacturing industries. Four years have been chosen for 
the cross-section analysis,’ namely, 1923, 1927, 1935, 1937, care being 
taken to avoid depression years and to seleet years which were rela- 
tively “normal.” 

In each of these given years, each manufacturing industry was 
treated as an observation with its respective quantities of Labor (JL), 
Capital (C), and Product (P). The definitions given to these terms were 
as follows: 


1. Labor (LZ) =total number employed, i.e., wage-earners and sal- 
aried workers. 

2. Capital (C) =total capital, including both (a) fixed capital in 
plant, tools, machinery, etc., and (b) working capi- 
tal including materials, goods in process, etc. 


* The authors wish to thank their friends and colleagues, Miss Grace T. Gunn and Mr. H. Gregg 
Lewis for numerous helpful suggestions. We also wish to acknowledge aid given, in computing by E. 
Olson, B. Nimer, and W. L. Slayton, and in drawing of the charts by Mr. Y. K. Wong. 

1 Paul H. Douglas, The Theory of Wages, pp. 113-167; M. Bronfenbrenner and Paul H. Douglas, 
“Cross-Section Studies in the Cobb-Douglas Function,” Journal of Political Economy, XLVJi (Decem- 
ber, 1939), pp. 761-785; Grace T. Gunn and Paul H. Douglas, “The Production Function for American 
Manufacturing in 1919,” American Economic Review, XXXI (March, 1941), pp. 67-30; and “The 
Production Function for American Manufacturing for 1914,” Journal of Political Economy, L (August, 
1942), pp. 595-602; Patricia Daly, Ernest Olson, and Paul H. Douglas, “The Production Function for 
Manufacturing in the United States, 1904,” Journal of Political Economy (forthcoming). 

2 Grace T. Gunn and Paul H. Douglas, “Further Measurements of Marginal Productivity,” Quar- 
terly Journal of Economics, LV (May, 1940), pp. 399-428; and “The Production Function for Aus- 
tralian Manufacturing,” ibid., LVI (November, 1941), pp. 108-129; Paul H. Douglas, The Theory of 
Wages, pp. 167-172; Paul H. Douglas and Marjorie L. Handsaker, “The Theory of Production as De- 
rived from Australian Data,” Quarterly Journal of Economics, LII (November, 1937, and March, 1938) 
pp. 1-36, 215-54. 

* For the sources, see the Canada Year Book for 1925, 1930, 1938, and 1940. 
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3. Product (P) =gross sales value of product minus cost of materials 
used, fuel, electricity, etc. 


The equation P=bL*C/ can be rewritten as a linear one by using the 
logarithmic values of the variables. The equation then becomes 


log P = logb + klogL + 7 log C. 


Fitting this equation by the method of least squares, the values in Ta- 
ble I are obtained. 

It will be noticed: 

1. That the sum of the exponents for labor and capital (i.e., k+ 7) 
closely approximate unity or 1.0. This is what has been found to be ap- 
proximately true in virtually all of the other studies and strengthens 
the conclusion that the production function is homogeneous and of the 
first degree. This corroborates the deductive conclusion drawn that 
under competition production is carried to the point of lowest cost on 
the average total unit cost curve which is the point of constant costs. 

2. That the exponents of both labor (k) and capital (7) are relatively 
steady. The span in values of k was from .43 to .50 which was less than 
the range of one standard error from the intermediate values. The span 
in values of j was from .48 to .58. 

3. The values of k are somewhat less and the values of 7 somewhat 
more than those for the United States found for fcur particular years 
of the period 1904-1919 which were as follows: 


Year k i 

1904 65 31 
1909 74 .32 
1914 -61 .36 
1919 76 .25 


This is not surprising. We have never claimed that the exponents of 
the factors should be the same for all economies and periods. Indeed in 
The Theory of Wages, it was explicitly stated that one might expect 
such variations.‘ In the United States where capital per worker is 
appreciably greater we might expect capital to have a lower and labor 
a higher exponent than in Canada. 

It is also interesting to note that the exponents of labor and capital 
for New Zealand as shown by a time study by Dr. Max Brown seem to 
approximate those of Canada.5 


‘See, Paul H. Douglas, The Theory of Wages, p. 203. 

5 The results of this study are given by Colin Clark in his book, Conditions of Economic Progress, 
p. 384. The material is contained in an unpublished thesis at Cambridge University, which through the 
courtesy of Dr. Brown we have had the privilege of reading. 
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We would conclude that the elasticity of the marginal productivity 
curve for labor in Canadian manufacturing for the four years studied 
was probably slightly under 2.0 and that for capital slightly over 2.0 
(i.e., under competitive conditions the elasticity of this curve is equal 
to the reciprocal of the exponent of the other factor). 

A test of the accuracy of the production function is the following: (1) 
compute for each industry from its actual quantities of Labor (L) and 
Capital (C) what the Product should be according to the formula. We 
shall call this P’; then (2) compare the actual product (P) with this 
computed product (P’) and see how great are the relative differences 


TABLE I 


VALUES OF EXPONENTS OF LABOR AND CAPITAL IN THE PRO- 
DUCTION FUNCTION FOR CANADIAN MANUFACTURES* 











Year No. of Industries k Ck j oj b k+j 
1923 167 .48 .038 -48 -035 48.5 .96 
1927 163 .46 .039 .52 -036 33.0 .98 
1935 165 -50 -044 -52 .040 22.2 1.02 
1937 164 43 041 .58 .036 15.4 1.01 
Average for four years .468 -525 .992 





* If we use fixed capital only for C, the exponents are changed to the following: 


Year k j b k+j 
1923 54 .44 82.0 .98 
1927 .49 .49 58.9 .98 
1935 .53 .49 33.5 1.02 
1937 .48 -53 32.9 1.01 
Average .610 .488 .998 


of the former from the latter and whether they can be explained by ran- 
dom errors of measurement or by other random causes operating upon 
the value product of specific industries. 

This has been done for the years 1923, 1927, and 1935; the results 
being shown graphically in Charts I to III inclusive. The logs of the 
actual product or P are shown on the horizontal axis or abscissa; the 
logs of the computed product or P’ upon the vertical axis or ordinate. 
Were there to be perfect correspondence between the two, each obser- 
vation would fall on the line BB’. The differences between the actual 
product (P) and those which we would expect under the formula (P’) 
are measured by the horizontal deviations of the observations from the 
line BB’. To measure these more accurately two sets of bands have been 
laid out on the charts. The first is at a distance of one standard error of 
estimate to the right and to the left of the line BB’ while the second set 
of bands is at a distance of from one to two standard errors of estimate 
from the line of expected relationship. 
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It will be noticed that the observations cluster closely around the 
line BB’ and that the scatter is not great. This is further evidence that 
the production formula furnishes a good description of the product in 
these manufacturing industries. In Table II there is shown the per- 
centage of observations deviating from the line of perfect correspond- 
ence by less than one and two, and by more than two standard errors 
of estimate of P’. Chart IV is a graph of the results for 1937. 


CHART I 
THEORETICAL AND OBSERVED VALUES OF LOG P FOR 1923 
Loe Pp’ 2’ 
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We should not expect that there would be a precise agreement in 
each industry between P and P’. The modern theory of causation does 
not require any such one to one relationship. It is enough that given 
causal forces express themselves in “norms” of results or tendencies 
accompanied by a more or less symmetrical range of variations result- 
ing from errors of sampling, errors of measurement, and sheer chance. 
That is one of the implications of the theory of probability. If the actual 
results do approximate the norms fairly closely and the variations are 
rather symmetrically distributed, then credence in the formula as both 
descriptive and predictive is greatly strengthened. 

In the present case we would not expect deviations due to sampling 
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errors because of the fact that the quantities of L, C, and P are all de- 
rived from an all inclusive census and not from any sample. Some dif- 
ferences between P and P’ would however inevitably result from: 

(1) Errors of measurement. 

(2) Shifts over time in the demand schedules for the end products of 
the industries causing some to move to the right and others to the left 


CHART II 
THEORETICAL AND OBSERVED VALUES OF LOG P FOR 1927 
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and hence affecting the money values of the products. Here it should be 
noted that under the quantity theory of money, if the supply of money 
and bank credit is constant, the average elasticity of demand is equal 
to unity. So is its reciprocal, the average flexibility of price. The varia- 
tions in P caused by shifts of the demand curves will under these condi- 
tions tend to be mutually compensatory and hence will operate in a 
non-biased manner. The deviations from the norm caused by such 
shiftings will have, therefore, some tendency towards symmetry. 

(3) Differences in the technical effectiveness of labor and capital in 
specific industries. We have never maintained that labor and capital 
must have the same exponents in each and every industry, but have 
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TABLE II 
DEVIATIONS OF ACTUAL PRODUCT (P) IN CANADIAN MANUFACTURING 
FROM PRODUCT (P’) COMPUTED UNDER THE FORMULA FOR THE 
YEARS 1923, 1927, 1935, AND 1937 








Percentage of observations of given deviations from P’ 





You Within +18 Within +28 Outside +28 
Per cent Per cent Per cent 
1923 72.46 92.81 7.19 
1927 74.23 95.09 4.91 
1935 72.17 95.76 4.24 
1937 76.83 93.90 6.10 





instead always pointed out these may well differ. The empirical ex- 
ponents which we find are instead merely norms or averages of these 
varying exponents. It would be expected, therefore, that the actual 
products from industry to industry would not jibe precisely with the 
computed products based on the use of these “average” exponents of 
labor and capital. But, since the exponents for manufacturing industry 
as a whole are the norms or averages for the specific industries, the 


CHART III 
THEORETICAL AND OBSERVED VALUES OF LOG P FOR 1935 
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variations of these actual exponents of labor and capital in the indi- 
vidual industries would again be more or less symmetrically distributed 
and would not be biased. 

(4) A fourth factor in causing differences between actual and com- 
puted P would be forces affecting production other than the quantities 
of labor and capital. 


CHART IV 
FREQUENCY DISTRIBUTION OF DEVIATIONS OF P FROM P’ FOR 1937 
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(5) Finally, sheer chance itself may also be expected to play a part 
in causing individual deviations from the norms. 

But, and this is crucially important, to the degree that these other 
causes operate at random and not in a biased manner, then the values 
of the exponents of labor and capital in the production function are the 
main determinants of the end result, namely, the value of the product. 
Under these conditions, we would expect that in two-thirds of the cases, 
the actual product would not deviate from the theoretical product by 
more than one standard error and that in 95 per cent of the cases, these 
deviations would not be more than two standard errors. 

In practice we find that about 74 per cent of the observations fall 
within one standard error of the theoretical production values, and 
that in approximately 94 per cent of the cases, the deviation is less 
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than two standard errors. The relative differences between the actual 
and computed values of P are therefore at least not more, and in part 
actually less than would be expected under the theory of normal prob- 
ability. Credence in the developed production function as a description 
of the relative effects of labor and capital upon product is thereby 
strengthened. The variations from it are normal. 

A study of the relatively small number of industries where the devia- 
tions of actual from theoretical products were more than two standard 
errors of estimate shows that in the case of charcoal and coke, the ac- 
tual product fell short of the theoretical by this amount in no less than 
three out of the four years; so too did clay products, linen goods, agri- 
cultural implements, and petroleum products in two out of the four 
years. On the other hand the actual values of abrasive products ex- 
ceeded the theoretical values by more than two standard errors in three 
out of the four years. Also the actual values of cigars and cigarettes 
exceeded the theoretical values by more than two standard errors in 
two out of the four years. In view of the high degree of monopoly pre- 
vailing in this industry this is not surprising. Similarly in the case of 
breakfast foods, the money values of the actual products also exceeded 
the theoretical values by more than two standard errors in two of the 
years. In view of the importance of trade-marked brands creating a 
state of highly imperfect competition this again is what should be ex- 
pected. Another industry where in two of the years the actual products 
were more than two standard errors above the theoretical values was the 
liquor industry.® , 

Some of the industries for which in one of the four years the actual 
value of product differed from the theoretical by more than two stand- 
ard errors were toys, chopping mills, coal tar, fly paper, signs, paper 
patterns, bicycles and motorcycles, fur dressing and dyeing, maple 
syrup and sugar, fertilizers, musical instruments, fountain pens, non- 
ferrous metals, sand lime brick, and wood distillation. On the whole a 
study of the deviations gives added proof of the general reliability of 
the formula and of its values for the four years studied. 

A further test of the function can be made by comparing the actual 
share of the value product received by labor with that which we would 
expect to be received with the production formula. Under conditions of 
perfect competition one would expect labor to receive as its proportion- 
ate share of the total product the proportion represented by its own 


- - 


exponent k or more properly by the ratio ——-;——— being a more proper 
k+j k+) 


® Distilled liquors in one year and malt liquors in another. 
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representation of labor’s share of product under conditions of increasing 
or decreasing returns where k+ is greater than or less than one. 
In Table III we make this comparison, the share of labor being rep- 














TABLE III 

¥ W/P k ° 
ear — 
k+J 

1923 .50 .48 .50 
1927 .48 .46 .47 
1935 -40 -50 .49 

1937 -52 .43 .43 
Average -475 -468 -472 





resented by W/P or the ratio of total wages and salaries to the net 
value of the product. Taking the four years as a whole there was an 
almost precise correspondence between the share which labor actually 
received and that which we would expect it to receive. In the first two 
years, this agreement was very close, while in the last two years there 
were compensating differences. In 1935, the share which labor received 
was from 9 to 10 per cent less than one would expect from the formula 
while in 1937 it was 9 per cent more. 

Taken in the large, the results of our Canadian studies seem to 
strengthen the case for the use of the production function. 








THE COMPOSITION OF INCOME AND OWNERSHIP OF 
CAPITAL BY INCOME CLASSES IN THE 
UNITED STATES IN 1936 


By Rourvs 8S. Tucker 
General Motors Corporation 


UCH CURRENT CONTROVERSY revolves around the point of the dis- 
M tribution of income and wealth, and the alleged necessity of chang- 
ing it, by taxation or other means. For that reason any statistical in- 
formation concerning their distribution, if it is fairly close to accuracy, 
should be helpful as a substitute for prejudices and assumptions. A 
good deal of information relating to the source of income in the vari- 
ous income classes can be gleaned from the recently published special 
studies of income tax returns for 1936, filed in accordance with the in- 
come tax laws of the United States and of the states of Wisconsin and 
Delaware. The chief defects of the Federal statistics—the classification 
by statutory net income and the large proportion of persons not re- 
quired to file returns—are remedied by the state statistics, which are 
classified by total income less capital gains and losses, and which cover 
a larger proportion of the population. Delaware, in fact, requires re- 
turns from every adult, irrespective of the size of his income. 

Wisconsin and Delaware are perhaps more prosperous than the av- 
erage state. Since, however, one is mainly agricultural and the other 
mainly industrial, they are not likely to deviate in the same direction 
from the national average in the matter of the composition of income 
within a given income class. For example, it is not likely that both 
would show a smaller proportion of the income of the $1,500-$2,000 
class derived from labor, or a larger proportion from property, than the 
average of all the other states, although they might both show that a 
larger or smaller proportion of all persons fell within the $1,500-$2,000 
class than would be the case in other states. For families with incomes 
under $2,500 and single individuals with incomes under $1,000, con- 
clusions derived from these state statistics are more likely to be correct 
than those derived from the Federal statistics, since the coverage is 
better and there is less bias on account of filing requirements and other 
legal technicalities. 

From these statistics it is possible to calculate the proportion of per- 
sons in each income class who received various kinds of income, and the 
proportion of their income received from each of the sources. Applying 
these proportions to the estimated numbers of persons and of income in 
each class we can build up aggregates showing the distribution of each 
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kind of income, hence the ownership of various kinds of capital. These 
figures can be worked out separately for families and single individuals; 
in some cases separately for single men and women. 

The family figures can be supplemented by the results of the Con- 
sumer Purchases Study, which were published in a large number of bul- 
letins by the Bureau of Labor Statistics and the Bureau of Household 
Economics, and summarized to some extent in the National Resources 
Committee volumes called Consumer Expenditures and Family Expendi- 
tures. Some of these data refer only to non-relief families and may there- 
fore overstate the importance of some forms of property income in the 
lower income classes. But if corrected, as they can be in many cases, for 
relief families, they may tend to understate property incomes, because 
they were derived to a large extent from families that either were 
poorer than the average or else understated their income from property.! 

The CPS data do not state separately profits from individually owned 
enterprises such as farms, although they make a general distinction be- 
tween farmers (some of whom earned some wages) and wage earners 
(some of whom made some profits). Delaware is not so much of a farm- 
ing state as most, and in addition farmers were believed to be most 
likely to neglect to file returns. Also in all states business and profession- 
al men (including farmers) are generally supposed to be most likely to 
understate their incomes, as there is less check on net profits than on 
other forms of income. Consequently the ratios derived from tax re- 
turns probably underestimate the importance of profits in all income 
classes except those large enough to cause the Treasury to make a 
special audit. No correction on this account has been possible. Wages, 
partnership income, rent and interest were reported at the source 
if they exceeded the amount of the individual or family exemption, 
and are therefore fully represented in the Federal individual tax 
returns above say $3,500 for families and $2,000 for single individuals. 
In Delaware they were reported at the source if in excess of $1,000, 
irrespective of the recipient’s family status. 

Dividends amounting to $100 or more in a year were reported at the 
source under the Federal law and were therefore fully reported by per- 
sons filing returns, but not necessarily reported at all by families under 


1 For evidence on this point see my articles in the Review of Economic Statistics, November 1940 
(Vol. XXII, pp. 165-182) and February 1942 (Vol. XXIV, pp. 9-21). Further evidence of understate- 
ment (or omission) is found in the fact that the number reporting savings bank accounts far exceeded 
the number reporting the receipt of either interest or dividends, in every income class. Also, in the in- 
come classes above $2,500, where the coverage of the tax returns is complete, the proportion of families 
reporting dividends and the proportion reporting interest on tax returns, each exceeded the combined 
proportion of families reporting either dividends or interest on the CPS returns. Likewise the proportion 
of total income derived from dividends or interest reported on the tax returns each exceeded the propor- 
tion of dividends and interest together reported in the CPS, for every income class. 
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$2,500 or singie individuals under $1,000. Consequently a large amount 
of dividends is omitted, since it was received by persons in the lower in- 
come brackets who filed no Federal returns. 

In general the tax ratios may slightly exaggerate the importance of 
property income and minimize the importance of individual profits in 
all classes between $2,500 and $20,000. Below $1,000 one might suspect 
thai they exaggerated the importance of property incomes, but later on 
in this article evidence is given to show that that was not the case with 
respect to the Delaware ratios which have been used for family cash in- 
come in classes below $2,500. To these ratios the NRC estimates of im- 
puted rental from owned homes and home-produced food and fuel have 
been added, since these items were not included in the Delaware returns. 

Of course the ratios from the various tax tabulations do not agree ex- 
actly; in some cases there are wide variations. But these variations can 
usually be explained by differences in the legal definition of the kind of 
income under consideration, or of the income taken as the basis for tab- 
ulation. Or else they are caused by differences in the extent of coverage, 
or by the inclusion of separately reporting husbands and wives in their 
individual income classes and not in the class where the family belongs. 
Fortunately it has been possible in every calculation to eliminate the 
separately reporting husbands and wives, and in most cases to put the 
combined family in its proper class. If, after this adjustment was made 
there were two ratios that seemed equally reasonable in the light of the 
definitions and coverage, that one was taken which showed the lower 
proportion of property income below $2,500, and the higher proportion 
above; and the opposite choice was made for labor income. 

Besides the tables showing the ratios of all persons receiving income 
from the different sources, and the total amount received, there are 
official tables of “Patterns” of income, showing the number of persons 
wholly relying on income from each source. These “Patterns” are in 
many respects unsatisfactory; they all lump together families, single in- 
dividuals and separately reporting husbands and wives. The United 
States “Patterns” includes estates and trusts. The Wisconsin “Patterns” 
has excessively wide limits to its income classes. Both the United States 
and the Wisconsin “Patterns” are tabulated according to a different 
definition of income from that used in the other tables. The United 
States “Patterns” tabulates the sources in such a way as to conceal the 
nature of the fourth source of income and to conceal the number of those 
having five or more sources. Wisconsin conceals the nature of the third 
source of income and the number of those having four or more sources. 
None of the “Patterns” include any imputed income, except an infini- 
tesimal amount of imputed profits in Wisconsin. 
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Consequently the number of incomes derived wholly from labor, or 
profits, or property, or profits and property together, is exaggerated by 
these “Patterns.” A considerable amount of correction is possible, by 
using other tax tables and those in the Consumer Purchases Study, but 
the figures used in this study probably still overstate the number of 
single-source incomes, especially single-source property incomes. 

The estimate of income distribution to which these ratios were applied 
was based on the one I published in this JourNnat last December, but 
adjusted to the calendar year 1936. If the National Resources Com- 
mittee’s distribution were used as a base it would increase both the 
number of property incomes and the number of labor incomes in the 
classes below $1,000 and above $7,500, and would give a larger total 
amount of property income and a larger number of persons receiving 
only property income. As I stated in that article, I believe that both the 
NRC figure and my own, for the under $500 class, were too large. 

The definition of income used for classification is the same as that 
used by the National Resources Committee in its studies, with the in- 
clusion of imputed income for families as estimated by the NRC, but 
no allowance for imputed income of single persons. There is, therefore, 
a slight exaggeration of the importance of labor income in the lower 
classes. Capital gains are not included in income in this analysis. 

We may now proceed to a consideration of the detailed tables, and 
the more important conclusions to be drawn from them. 


LABOR (WAGES AND SALARIES) 


In the first place the commonest source of income is labor. That is not 
a startling discovery, but the distribution of labor income by classes is 
perhaps a little unexpected. The greatest proportion of labor income is 
found, not in the lowest class (under $500) but in the class $1,000— 
$1,500, where 90 per cent of the consumer units report it. The propor- 
tion declines with increasing income until 71.7 per cent is reached in the 
$10,000—-$15,000 class, but above $15,000 over 75 per cent report labor 
incomes. 

Taking the proportion of labor income to all income: that also is 
highest in the $1,000-$1,500 class and is higher in every class up to 
$3,500 than it is in the lowest class (under $500), where it is only 65.7 per 
cent. Even above $20,000 the proportion of labor income was 22 per 
cent. Except in the $1,000—$2,000 class, families were more likely than 
single persons to have labor incomes, and in every class single men were 
more likely than single women to have labor incomes. 

Considering now the proportion of consumer units wholly dependent 
on labor income, here again the lowest class does not show the highest 
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TABLE I 
PER CENT RECEIVING ANY INCOME FROM SPECIFIED SOURCES, 
UNITED STATES, 1936 
(Consumer units) 
‘ Any Any property 
Income class Labor Business Partnership Imputed profits or profits 
(cash) profits . . 
(min.) (min.) 
O- 500 78.2 7.6 4 15.6 17.8 34.3 
500— 1,000 88.2 8.0 .4 18.0 21.4 24.4 
1,000— 1,500 90.0 8.4 1.7 16.0 22.3 24.6 
1,500— 2,000 89.0 13.7 2.1 14.1 23.8 36.3 
2,000— 2,500 86.7 19.0 2.8 11.5 23.2 54.2 
2,500— 3,000 83.6 19.2 4.2 12.1 25.7 56.3 
3 ,000— 3,500 81.4 21.4 5.7 11.7 28.0 64.3 
3,500— 4,000 80.3 23.7 6.8 11.9 28.9 75.5 
4,000— 4,500 79.1 26.7 7.8 10.8 33.6 78.1 
4,500— 5,000 77.9 28.6 7.6 10.6 35.9 81.2 
5 ,000—10 ,000 75.0 34.5 10.5 10.7 46.2 89.2 
10 ,000—15 ,000 71.7 40.7 13.9 7.0 51.6 96.5 
15 ,000—20 ,000 75.7 44.6 14.7 7.0 56.5 96.9 
20,000 & over 75.3 50.8 18.3 6.9 58.5 99.8 
Fiduciary Rent Imputed 7 oy 
Income class Sates Interest (cash) aeons Dividends property 
(min.) 
oO 500 4 17.6 6.7 17.6 7.4 26.8 
500- 1,000 4 10.0 5.0 19.5 5.3 19.5 
1,000- 1,500 5 13.1 5.0 24.6 7.1 24.6 
1,500— 2,000 of 19.4 7.0 31.0 10.6 31.0 
2,000— 2,500 3.8 22.2 10.8 38.5 18.7 43.3 
2,500—- 3,000 1.8 27.2 13.9 47.6 20.4 52.0 
3,000- 3,500 2.1 33.3 17.7 47.7 25.0 52.9 
3,500— 4,000 2.6 34.2 21.9 46.8 31.9 61.4 
4,000— 4,500 3.6 35.2 23.9 55.0 38.0 66.1 
4,500— 5,000 4.2 36.5 24.5 55.5 42.1 63.6 
5 ,000-10 ,000 7.5 44.1 27.4 54.4 47.3 74.9 
10 ,000-15 ,000 11.1 56.3 28.5 48.9 71.0 83.6 
15 ,000-20 ,000 13.5 62.4 28.1 48.0 78.9 91.4 
20,000 & over 29.3 74.1 32.7 47.2 87.3 95.2 





proportion, but is surpassed by the classes between $500 and $1,500. 
The highest proportion is in the class $500—$1,000, if cash income alone 
is considered, and in the class $1,000-$1,500 if allowance is made for 
imputed income. In most classes families appear to be more likely than 
single individuals to depend wholly on labor income. Perhaps that is 
because young unmarried persons can save more easily than married 
ones, and aged persons who have acquired property during their prime 
are likely to be widowed. In every class single men are much more likely 
than single women to be dependent on labor only. 

If the ratios presented in Table I are applied to the distribution of in- 
come presented in my article in the December JouRNAL, the number of 
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consumer units receiving some labor income comes out as 35,433,000. 
This represents a considerably larger number of workers since working 
wives and children are not counted. The total of gainfully employed 
reported by the 1930 Census was 36,470,000 after deducting farm 
owners and operators and the unpaid members of their families. Plainly 
the number of hired workers has not been underestimated in this study. 
Similarly the total of wages and salaries obtained by using the ratios 
of income shown in Table III is $45,387,000,000, while the amount cal- 
culated for 1936 by the Department of Commerce is $42,125,000,000. 


TABLE II 


PER CENT RECEIVING ANY INCOME FROM SPECIFIED SOURCES, 
UNITED STATES, 1936 


























Families* Single personst 

Encome class Sis Profits Property —— Profits Property 
(min.) (min.) (min.) (min.) 

Oo 500 83.5 30.9 29.5 75.0 3.4 26.8 
500— 1,000 89.0 32.4 29.1 87.0 4.9 13.4 
1,000— 1,500 90.0 29.0 33.4 92.6 3.8 18.4 
1,500— 2,000 88.3 27.2 36.9 90.0 5.7 29.1 
2,000— 2,500 86.7 25.4 44.6 84.7 9.4 43.3 
2,500— 3,000 82.0 7.1 52.0 80.1 12.8 53.3 
3,000-— 3,500 78.8 29.3 52.9 74.7 16.2 64.5 
3,500— 4,000 76.3 30.5 52.9 63.7 17.6 64.5 
4,000— 4,500 75.0 35.6 62.8 61.1 18.8 75.3 
4,500— 5,000 77.4 38.2 62.8 7.8 19.1 75.3 
5 ,000—10 ,000 73.8 50.2 69.1 54.8 20.2 82.3 
10 ,000—15 ,000 73.6 51.6 82.0 55.1 18.5 92.6 
15 ,000—20 ,000 77.4 56.5 90.5 53.3 18.4 86.9 
20,000 & over 81.3 58.5 96.0 52.7 13.2 94.2 

Single ment Single woment 

Income class “— Profits Property an Profits Property 
(min.) (min.) (min.) (min.) 

Oo 500 87.8 4.2 13.8 64.7 2.7 34.4 
500— 1,000 91.6 5.6 8.6 82.2 4.1 18.5 
1,000— 1,500 93.2 4.5 12.6 87.6 3.1 25.5 
1,500— 2,000 92.8 6.0 20.7 80.6 5.0 42.5 
2,000-— 2,500 7.3 9.8 29.4 74.3 8.4 60.3 
2,500— 3,000 84.0 13.9 41.7 65.1 10.8 67.4 
3,000— 3,500 78.1 18.3 56.9 50.0 12.5 73.0 
3,500— 4,000 78.1 20.0 56.9 50.0 14.0 73.0 
4,000— 4,500 82.0 22.5 66.0 31.9 13.5 85.1 
4,500— 5,000 82.0 23.1 66.0 31.9 13.5 85.1 
5 ,000—10 ,000 69.5 25.5 80.0 20.9 13.7 84.9 
10 ,000—15 ,000 55.1 24.4 88.9 5.3 12.3 96.3 
15 ,000—20 ,000 80.0 25.6 80.0 27.3 11.5 92.1 
20,000 & over 53.8 20.5 92.3 17.0 8.0 95.7 





* Including imputed rent and profits. 
+ Excluding imputed income. 
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TABLE III 


PER CENT OF INCOME RECEIVED FROM SPECIFIED SOURCES, 
UNITED STATES, 1936 
(Consumer units) 




















Economic in- a — Partnership Profits ee All profits 
come class waned ne profits (imputed ) All profits or property 
(cash) 

O- 500 65.7 4.3 3 10.4 15.0 34.3 
500- 1,000 75.6 5.7 3 8.2 14.2 24.4 
1,000— 1,500 79.2 6.4 -4 5.4 12.2 20.8 
1,500— 2,000 78.0 8.4 8 3.9 13.1 22.0 
2,000— 2,500 75.0 10.0 1.5 3.7 14.2 25.0 
2,500— 3,000 72.0 10.6 2.2 2.6 15.4 28.0 
3 ,000- 3,500 68.7 12.3 2.8 2.1 17.2 31.3 
3,500— 4,000 64.4 13.9 3.5 2.0 19.4 35.6 
4,000-— 4,500 60.4 15.1 4.3 1.4 20.8 39.6 
4,500— 5,000 58.2 15.1 4.7 1.3 21.1 41.8 
5 ,000—10 ,000 47.5 14.9 6.2 9 22.0 52.5 
10 ,000-15 ,000 39.4 12.7 7.8 4 20.9 60.6 
15 ,000—20 ,000 34.3 11.1 8.4 3 19.8 65.7 
20,000 & over 22.0 5.6 9.5 1 15.2 78.0 
Economic in- ys Interest ment , Rent Dividends All 

come class (cash) (imputed) property 

property 

0O- 500 4 6.1 4.6 5.6 2.6 19.3 
500— 1,000 4 3.3 2.3 4.0 1.3 10.2 
1,000— 1,500 5 1.4 1.4 4.2 is 8.6 
1,500— 2,000 a 1.3 1.5 4.2 1.2 8.9 
2,000-— 2,500 1.0 1.8 2.0 4.2 1.8 10.8 
2,500- 3,000 1.8 1.8 2.2 4.2 2.6 12.6 
3 ,000- 3,500 1.8 2.0 3.7 4.2 3.4 14.1 
3,500- 4,000 2.6 2.5 2.7 4.1 4.3 16.0 
4,000— 4,500 3.6 2.9 2.6 4.0 5.7 18.8 
4,500— 5,000 4.2 3.2 2.6 4.0 6.7 20.7 
5 ,000-10 ,000 5.4 9.2 2.5 3.5 9.8 30.5 
10 ,000-15 ,000 8.0 11.3 1.5 3.1 15.8 39.7 
15 ,000—20 ,000 9.1 12.3 1.5 2.9 20.1 45.9 
20,000 & over 11.4 11.7 1.8 2.5 35.5 62.8 





The total receiving no cash income except wages or salaries was 
about 25,000,000 according to these calculations; if an allowance were 
made for rent of owned homes and imputed profits received by families 
the total might be as low as 11,000,000. For comparison the total num- 
ber of consumer units in the United States in 1936 is estimated to have 
been 41,234,000. But, as will be shown later, these persons with no 
income but wages and salaries cannot be regarded as a propertyless 
proletariat. Many of them had high incomes, and most of the families 
with low incomes owned property, such as automobiles, radios and 
other durable consumers’ goods, and insurance policies, and bank ac- 
counts. Moreover many of the single persons owned their homes al- 
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though it has been impossible to include their imputed rental in these 
tables. 


PROFITS, OR ENTREPRENEURIAL INCOME 


Business and partnership profits include the income of farmers, pro- 
fessional people, and unincorporated merchants and manufacturers. 
They stand between wages and salaries, which represent payment for 
labor, and the various kinds of property income which are payments for 
the use of capital. They are a joint product of labor and capital; the per- 
sons who receive them are both laborers and capitalists. Individual 


TABLE IV 


PER CENT OF INCOME RECEIVED FROM SPECIFIED SOURCES, 
UNITED STATES, 1936 


























Families* Single personst 
Income class 
Labor Profits Property Labor Profits Property 
oO 6500 61.6 23.1 15.3 75.0 3.4 21.6 
500- 1,000 71.3 19.3 9.4 85.3 4.3 10.4 
1,000— 1,500 76.8 15.0 8.2 90.4 3.1 6.5 
1,500— 2,000 77.2 14.6 8.2 84.5 4.5 11.0 
2,000— 2,500 75.0 15.4 9.6 73.8 7.0 19.4 
2,500-— 3,000 73.0 15.8 11.2 62.3 9.5 28.2 
3,000— 3,500 70.0 BM 12.3 57.3 11.9 35.7 
3,500— 4,000 66.0 20.1 13.9 47.1 12.9 41.1 
4,000- 4,500 62.4 21.6 16.0 40.5 13.6 45.9 
4,500— 5,000 60.6 22.0 17.4 38.0 14.0 48.0 
5 ,000-10 ,000 50.2 23.2 26.6 32.7 13.3 60.2 
10,000-15 ,000 42.6 22.4 35.0 27.0 11.3 70.6 
15 ,000-20 ,000 37.8 21.3 40.9 23.7 10.9 74.2 
20 ,000 & over 24.6 16.6 58.8 19.5 8.0 81.7 
Single ment Single woment 

Income class 
Labor Profits Property Labor Profits Property 
Oo 500 86.4 4.2 9.4 64.7 2.7 32.3 
500—- 1,000 90.0 5.3 4.7 80.1 3.4 16.4 
1,000— 1,500 92.5 3.7 3.8 87.6 1.8 10.6 
1,500— 2,000 89.2 5.0 5.8 74.7 3.0 22.3 
2,000— 2,500 82.0 7.9 10.1 55.4 4.9 39.7 
2,500- 3,000 73.1 11.3 15.6 41.7 6.2 52.2 
3,000—- 3,500 64.7 15.0 20.3 31.0 6.5 62.5 
3,500— 4,000 59.6 16.4 24.0 24.7 7.4 67.9 
4,000— 4,500 54.3 18.0 27.7 20.6 7.2 72.2 
4,500— 5,000 52.4 18.8 28.8 17.3 7.2 75.5 
5,000-—10 ,000 40.4 19.8 39.8 9.3 6.1 84.6 
10 ,000—15 ,000 30.9 18.5 50.6 4.9 4.6 90.5 
15 ,000-20 ,000 26.8 19.3 53.9 3.7 4.2 92.1 
20,000 & over 17.3 14.2 68.5 2.5 3.1 94.4 





* Including imputed rent and profits. 
+ Excluding imputed income. 
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TABLE V 


PER CENT OF CONSUMER UNITS RECEIVING INCOME ONLY FROM 
SPECIFIED SOURCES, UNITED STATES, 1936 














Labor Profits Property 

Income class 
min.* min.¢ max.* min.¢ max.t min.* min.¢ max.t 
oO 6500 28.5 61.7 65.7 3.7 3.7 0.2 15.8 15.8 
500- 1,000 41.4 73.8 75.6 4.9 5.0 —_ 4.2 5.8 
1,000- 1,500 34.9 66.1 69.1 4.5 5.2 _— 1.7 3.4 
1,500— 2,000 21.9 57.1 63.1 5.9 6.5 1.5 2.9 
2,000—- 2,500 2.9 42.0 45.8 6.7 7.1 — 2.0 3.0 
2,500- 3,000 _ 42.8 43.7 6.3 8.4 — 2.0 4.1 
3 ,000- 3,500 _— 29.9 35.7 5.3 8.5 —- 3.0 4.4 
3,500— 4,000 _— 24.5 24.5 5.3 9.1 _ 3.0 6.3 
4,000- 4,500 _ 7.3 21.9 5.7 9.4 _ 4.1 6.9 
4,500- 5,000 _ 17.1 17.8 5.7 9.2 — 4.1 7.8 
5,000-10 ,000 _ 5.4 10.8 3.0 8.1 _- 5.6 10.4 
10 ,000-15 ,000 _ _— 3.5 -7 5.5 1.6 8.5 16.0 
15 ,000-20 ,000 _ —— 3.1 _ 4.2 1.5 8.4 15.4 
20,000 & over _ — 0.2 9 2.3 4.6 11.5 16.7 





* Allowing for imputed rent and profits. The minimum for prefits, on this basis, is zero in all 
oe income only. 
business cash profits and partnership profits, as shown by Table I, 
were proportionally most numerous in the classes above $3,500. But 
items of imputed profits (food, fuel and merchandise from stock) were 
commonest in the classes below $3,500. The ratio of all profits to total 
income, as reported on tax returns, varied slightly among the income 
classes, and was apparently highest between $3,000 and $15,000. Ac- 
cording to the NRC, however, independent professional and business 
families (including farm families) were most frequent in the class below 
$500 and the classes over $4,000. Families were more likely than single 
individuals to receive profits, perhaps on account of the high propor- 
tion of married men among farmers. Single men also surpassed single 
women in the matter of profits, in every income class. 

Only a small proportion of consumer units relied wholly on profits. 
That proportion was greatest in the classes between $1,500 and $4,000, 
and consisted mainly of families, although according to the Delaware 
returns a fairly high proportion of single men in the classes between 
$2,500 and $15,000 may have had no other kind of cash income. 


PROPERTY INCOME 


Property income is classified as dividends, interest, rent (including a 
small amount of royalties), and fiduciary income (i.e., income from es- 
tates and trusts). These will be taken up in that order, after which prop- 
erty owners of all sorts will be combined. 
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Dividends.—The total number of consumer units receiving dividends 
in 1936 was apparently about 4,731,000. Of course there were many 
other stockholders who received no dividends and in many families 
there was more than one stockholder. The total number of voters re- 
ceiving dividends was over 8,200,000, allowing two to a family and as- 
suming that each individual taxpayer is a voter. Dividend receivers 
were least numerous, proportionately, in the $500-$1,000 class, where 
only one person in nineteen received dividends; most numerous in the 
wealthiest class, where over 87 per cent received them. About half of 
the dividend receivers had total incomes from all sources amounting 
to less than $2,000, and they received about 10 per cent of the dividends. 
Of all kinds of income studied dividends were the least evenly distrib- 
uted. Half of them were received by persons with incomes over $16,000. 
In the classes under $15,000 single women were much more likely to re- 
ceive dividends than were single men, and families were less likely. 

Interest.—A surprisingly large number of persons in the lowest in- 
come brackets, and of single women in all income brackets, received in- 
terest. Half of the interest receivers had total incomes under $1,615, and 
half of the interest income was received by consumer units with total 
incomes under $3,280. But if the survey made by the Consumer Pur- 
chases Study can be relied on there were apparently also a large number 
of families who had savings accounts, presumably paying interest, but 
who did not admit receiving any interest, either to the CPS investiga- 
tors or on their tax returns. However, without counting these owners 
of non-interest paying savings bank accounts, or attempting to explain 
them, there were 4,434,000 families and 2,086,000 single individuals 
who received interest, or over 11,000,000 voters. Including the savings 
accounts there were 6,282,000 families, making the number of voters 
14,650,000. The total amount received appears to have been $2,463,- 
000,000. 

Rents.—Rent is a form of income that is nearly as evenly distributed as 
wages and salaries, if the imputed rent received by home owners is in- 
cluded. There were in 1930, according to the Census, 14,002,000 home 
owners; in 1940 there were 15.196,000. Only 11,415,000 home-owning 
families can be located by the methods used in the present study, using 
the ratios of home ownership in each income class reported by the Con- 
sumer Purchases Survey. Half of them had incomes under $1,570. The 
tax reports enable us to locate 3,282,000 consumer units receiving cash 
rent, most of whom were probably also home owners, and all of whom 
have been treated as home owners in this analysis. But since some of 
them were not, the total number receiving rent, cash and imputed to- 
gether, is underestimated. Cash rent was more evenly distributed than 
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dividends, but not as evenly as interest. Half of the consumer units 
receiving it had incomes under $1,760, and half of the cash rent re- 
ceived was received by consumer units with incomes under $2,250. 

Fiduciary (Estates and Trusts).—Fiduciary income was unimportant 
in amount and received by only a very small proportion of persons in 
each income class except the highest. In this study it has been necessary 
to lump fiduciary income together with some miscellaneous sources, 
such as annuities and other items partaking of the nature of interest 
but not reported as such on the tax returns. 

All Property Income Combined.— When property incomes are lumped 
together the number in the 0-$500 class, and the amount, are surpris- 
ingly large. Checking the Delaware tax data against the CPS figures for 
families we find that the tax returns show a larger amount of interest 
and dividends than the CP§; in fact, they show a larger amount of either 
interest or dividends than the sum of the two as reported by the CPS. 
But there is reason to believe that the CPS proportions were below the 
truth.? 

In the case of rent, other than imputed rent from owned homes, the 
CPS ratio of families receiving it was higher than the tax return ratio 
for classes between $500 and $1,500, although the ratio of rent income 
to total income was lower. 

The CPS also reported an item of income other than earnings or rent 
or dividends or interest or imputed rent. This consisted mainly of an- 
nuities and pensions, which are of course property income, although of 
a non-transferable kind, since they are not paid for current services. 
Adding together all the property income reported by the CPS the total 
is a larger proportion of total income than that derived from the tax re- 
turns in each class under $2,000; and like the proportions derived from 
the tax returns they become larger as the income declines from $1,250 
toward zero. 

A similar increase is shown by the CPS study in the proportion of 
families receiving cash rent or interest and dividends or imputed rent, 
as the total income declines below $750. A little reflection shows why 
this must be so: a large proportion of the people in the lowest income 
class are there because they are incapable of earning anything—be- 
cause of age or physical disability. Such small receipts as they may en- 
joy from property they have saved or inherited constitute a large pro- 
portion of their small incomes. In many cases they are able to spend 
more than their income by gradually drawing down their capital. 

The CPS figures for imputed income from home ownership are in- 
cluded in property income in these tabulations. All of the tax return 


2 See footnote 1. 
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ratios used are based on total income, excluding capital gains, and 
where necessary further adjusted to allow for imputed income, which 
was taken from the NRC reports. 


THE PROLETARIAT AND THE PROPERTIED CLASS 


The simple picture of a class society constructed by Karl Marx and 
accepted by unobserving persons in this country is not supported by the 
income tax statistics. We do not have a large propertyless proletariat, 
living in poverty and despair, ruled by a small class of capitalists, living 
in luxury and idleness. Every income class received over one-fifth of its 
income from sources other than wages; no income class received less 
than one-fifth of its income from the sale of its labor to employers; and 
no income class received less than 37 per cent of its income from labor 
of some sort—either working for others or actively conducting a busi- 
ness enterprise. The number of idle rich is small. Of the 231,000 con- 
sumer units with incomes over $10,000 not over 51,000 lived entirely on 
income from property and at least 16,000 or 17,000 of these were single 
women (including widows). 

Of the 5,915,000 consumer units (if that is the right number) with 
incomes under $500 on the other hand, 935,000 were entirely dependent 
on property for their income, except if they were able to raise part of 
their own food and fuel; and of these some 370,000 were single women. 
One-half of the persons who received any income from property had 
total incomes under $1,110. The median income of all consumer units 
receiving wages or salaries was $1,220, while the median income from 
all sources was $1,214. Another important fact, not shown by the sta- 
tistics but known to all of us, is that most of the male capitalists had 
earned their capital by their own efforts, and many of the propertyless 
laborers will earn enough capital, before they become too old to work, 
to maintain them in their old age. As a corollary of this we find a large 
proportion of persons in the lowest 10 per cent of the population—the 
“submerged tenth”—who live entirely or mainly on the fruits of their 
savings. These, and the single women of all income classes, constituted 
almost the whole of our non-working “leisure class” of capitalists. 

The corporations of the country were not owned by sixty families or 
200 large stockholders. It required at least 126,000 of the wealthiest con- 
sumer units to muster one-half of the shares. Half of the stockholding 
units had incomes under $2,030. The unincorporated businesses of the 
country were even more widely owned; half of the 9,800,000 con- 
sumer units receiving business or partnership profits (omitting im- 
puted profits) had incomes under $1,295. 

The homes of the country were owned by about 15,000,000 consumer 
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units, not counting the number indirectly owned through corporations 
and associations. There were at least, 12,700,000 consumer units who 
owned bonds, mortgages or bank accounts, and half of them had in- 
comes under $1,780. 

Some other important forms of capital were not referred to in the tax 
returns. Some information is available from the Consumer Purchases 
PROPORTION OF FAMILIES OWNING PROPERTY 
BY INCOME CLASSES 
(United States-1936) 

£ OF FauiL ies 
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INCOME CLASS IN DOLLARS 


Study concerning the ownership of automobiles and insurance policies, 
and is shown in the attached chart. Clothing and furniture are of course 
very widely owned but may be regarded as consumer goods rather than 
capital. But the ownership of a most important type of capital—the 
tools and machines, shops, stores, inventories and farm equipment nec- 
essary to earn business or partnership profits—can only be surmised 
from the distribution of profits. 

On the whole the best measure of the distribution of ownership of all 
forms of productive capital together appears to be the combined income 
from property and profits. According to this measure one-half of all 
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the productive capital of the country was owned by consumer units 
with incomes under $3,065, and half of the owners had incomes under 
$1,575. 

We must conclude that the vast bulk of our people are both capital- 
ists and laborers during the prime of their lives; they begin their careers 
without much capital except their bodies and their brains, and they 
wind up with very little earning power in them, but with enough capi- 
tal to live on. This fact largely explains the political stability we have 
had in the past and is the best hope for political stability in the future. 
Certain other factors in our political stability cannot be measured by 
the available material, but should be covered by future investigations. 
They include the age and race factors—to what extent the propertyless 
persons are young, or Negro, or recently arrived foreigners—and the 
fluidity factor, by which is meant the shift of individuals from income 
class to income class and the length of life of large fortunes. My own 
belief is that, if those factors were studied it would be found that in the 
matters of equality of opportunity and wide distribution of wealth, 
the United States compares favorably with any nation now existing, or 
any important civilized nation of the past. 


Nore: Several tables showing in greater detail the facts mentioned in this article had to be omitted 
for lack of space. A limited number of copies will be sent by the author to those requesting them. 








COMPARISON OF DEMOGRAPHIC RATES ASSUMED BY THE 
NATIONAL RESOURCES COMMITTEE WITH 
ACTUAL EXPERIENCE 


By Rosert J. Myers 


N ITS POPULATION projections, the National Resources Committee 
I assumed certain death rates and birth rates for future periods' up 
to 1980. The assumed future rates had as a base point the actual rates 
for the period 1930-34. Now that five years have elapsed and actual 
census material is available, it is worth while to make a comparison of 
the actual rates with those estimated. It is hardly necessary to point 
out that forecasts of this type are especially dubious for the immediate 
future because of the effects of the war. 

In order to facilitate computations, the type of death rate used by the 
NRC differed somewhat from the usual age-specific rates of life tables. 
For a given quinquennial age group the probability of dying within 
five years was set down for various future quinquennial periods. In 
order to project births, there was developed a special death rate to give 
the probability of uniformly-spaced births in a five-year period dying 
before the end of that time. Three sets of rates, low, medium, and high, 
were developed for native whites, foreign-born whites, and Negroes 
with, of course, separate rates for males and females. 

Perhaps the best actual mortality data for current purposes is the 
set of 1940 population life tables? prepared by the Statistical Bureau of 
the Metropolitan Life Insurance Company,’ from which death rates 
of the same form as used by the NRC may be developed. In order to 
make a consistent comparison, it is necessary to make certain adjust- 
ments in the NRC figures. First, NRC figures applicable to 1940 may be 
obtained by averaging the rates for any particular age group for the two 
quinquennial periods, 1935-39 and 1940-44. Second, a combination of 
the NRC mortality rates for native whites and foreign-born whites 
must be made since the Metropolitan life tables which are being used 
for comparison are only on the basis of total whites. This was accom- 
plished by weighting native white and foreign-born white death rates 


1 The actual figures appear in Tables 1B and 1C of Population Statistics, Vol. 1, National Data, 
National Resources Committee, October 1937. Details of the methodology are contained in P. K. 
Whelpton, “An Empirical Method of Calculating Future Population,” this Joonnat, September 1936, 
pp. 457-473. 

2 It will be noted that the actual mortality experience is that for only a single year, 1940. However, 
the mortality of that year differed only slightly from that of surrounding years; the crude death rate for 
1940 was 10.7 per thousand as contrasted with an average of 10.6 for the three years 1939-41. 

3 Presented in abridged form in Statistical Bulletin, Metropolitan Life Insurance Company, Decem- 
ber 1941, p. 8. The complete table was made available through the courtesy of Dr. A. J. Lotka. 
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by the respective proportions of these groups relative to the total white 
population as indicated by the 1940 Census.‘ 

In Table I there are set down separately by sex and race the actual 
1940 death rates on a comparable basis with the NRC figures. Table 
IIa and IIb compare these rates with those of the NRC modified as de- 
scribed previously to apply to1940. At the younger ages the actual rates 


TABLE I 


ACTUAL 1940 FIVE-YEAR TIME INTERVAL DEATH RATES 
FOR QUINQUENNIAL AGE GROUPS* 














White persons Colored persons 
Age group 
Males Females Males Females 

Birth to 0-4f 48.5 38.1 75.9 59.6 
O- 4 10.1 8.3 16.4 13.8 
5- 9 5.1 3.7 7.0 5.9 
10-14 6.7 4.7 13.1 13.9 
15-19 9.8 6.9 25.0 25.0 
20-24 12.0 8.9 34.9 1.1 
25-29 13.8 10.8 41.8 36.5 
30-34 17.5 13.7 49.9 43.8 
35-39 24.6 18.1 63 .6 57.7 
40-44 36.7 25.5 85.3 76.4 
45-49 55.4 36.9 117.9 103.9 
50-54 82.5 54.1 150.7 136.7 
55-59 118.7 80.7 171.2 155.6 
6C-64 169.5 124.9 209.9 183.0 
65-69 243.1 193.8 265.6 230.6 
70-74 345.1 289.5 326.5 288.4 
75-79 476.5 426.0 412.6 349.0 
80-84 623.9 600.0 546.2 420.4 
85-89 761.5 774.9 699.7 555.7 
90-94 864.9 882.6 832.1 732.6 

















* Derived from U. S. 1940 Life Tables prepared by the Statistical Bureau of the Metropolitan Life 


Insurance Company. 
+ This represents the number of deaths within a five-year time interval among 1,000 infants born 


uniformly throughout that period. 


for both white males and white females are appreciably lower than even 
the “low” estimate; thus, for white males the differential is 20 per cent 
at the youngest childhood ages and prevails to some extent to age 35, 
whereas for white females the differential is even larger and persists 
until the older ages. For white males the actual rates beyond age 35 are 
about 5 per cent in excess of the “low” estimates, but for white females 
the corresponding estimates are about 5 per cent too high for ages 35 


4 Based on the release dated April 10, 1942, Age Composition of the Population, Series P-10, No. 6, 
Bureau of the Census. 
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through 55. Up to age 50 the “high” estimates exceed the actual rates 
for white males, but beyond there is a slight differential in the opposite 
direction. The “high” rates for white females are above the actual death 
rates at all except the extreme ages. 


CHART I 


COMPARISON OF ACTUAL 1940 FIVE-YEAR DEATH RATES AND 
THOSE OF NRC—WHITE MALES 


DEATH RATE PER 1,000 DEATH RATE PER 000 
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Charts I and II dealing with white males and white females respec- 
tively make a graphical comparison of the actual 1940 death rates with 
those indicated for 1940 by the NRC and also with those for the “ulti- 
mate” condition in 1975-79. The difference between the actual rates 
and the 1940 estimates at the younger ages is quite striking, but be- 
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yond age 35 for males and 55 for females there is little difference. At the 
childhood ages for both males and females and at the younger adult 
ages for females the actual 1940 rates fall well within and, in some in- 
stances, below the ultimate range which, it might be pointed out was 


CHART II 
COMPARISON OF ACTUAL 1940 FIVE-YEAR DEATH RATES AND 
THOSE OF NRC—WHITE FEMALES 
DEATH RATE PER 1.000 
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extremely narrow at the younger ages, although wisely wide with ad- 
vancing age. 

Actual 1940 colored death rates have fallen fairly closely within the 
range developed by the low and high estimate as may be seen from 
Table IIb. The greater amount of fluctuations present as contrasted 
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with the fairly smooth trend for white persons in Table Ila arises from 
the well-known difficulties in obtaining accurate reporting for this pop- 
ulation group. 

TABLE II 


ACTUAL 1940 FIVE-YEAR TIME INTERVAL DEATH RATES 
AS PERCENTAGE OF ASSUMED NATIONAL 
RESOURCES COMMITTEE RATES 





























a. White persons* b. Colored personst 
Males Females Males Females 
Age group 
as per cent of as per cent of 
“Low” “High” “Low” “High” “Low” “High” “Low” “High” 

Birth to 0-4} 94 89 95 90 111 102 107 99 
0- 4 80 77 75 72 96 88 89 80 
5- 9 80 77 80 77 90 80 95 84 
10-14 7 81 87 81 109 91 122 98 
15-19 94 84 78 70 107 86 102 81 
20-24 97 87 79 70 108 89 108 87 
25-29 93 85 79 70 106 91 104 87 
30-34 94 84 85 75 98 84 104 87 
35-39 101 90 93 83 104 90 108 93 
40-44 101 93 93 85 103 93 105 94 
45-49 107 99 95 89 114 105 111 102 
50-54 109 103 96 91 117 110 105 98 
55-59 105 102 92 89 105 101 94 90 
60-64 105 101 96 92 103 100 91 87 
65-69 104 101 99 96 102 99 95 92 
70-74 105 101 99 95 103 99 102 98 
75-79 103 101 100 98 94 92 95 94 
80-84 103 102 102 102 95 95 97 97 
85-89 104 104 107 107 97 97 106 106 
90-94 104 104 109 109 104 104 109 109 























* Averages of 1935-39 and 1940-44 rates for native and foreign-born whites weighted by respective 
proportions in 1940 census for given age-sex group (for quinquennial age groups beyond age 75, where 
census data are not subdivided by age groups, proportion for those 75 and over is used). 

t Averages of 1935-39 and 1940-44 rates for Negroes. 

t See footnote ¢, Table I. 


One other comparison that may be made is to determine the calendar 
year in which the assumed “medium” death rate is the same as the ac- 
tual 1940 one (Table III). This will indicate the length of time that has 
been “gained” in regard to “expected” mortality improvement. For 
white males up to age 40 there has been a gain of about 5-10 years with 
the exception of ages 5-15 where we are now already lower than the 
ultimate figure. For white females there is even more of a gain with such 
periods being 20-40 years up to age 30 but less thereafter up to age 60. 
For colored persons there are relatively few gains, except that at the 
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older ages actual 1940 death rates are in many instances lower than the 
estimated ones; how much validity may be attached thereto is open 
to question. 

TABLE III 


CALENDAR YEAR IN WHICH ASSUMED NATIONAL 
RESOURCES COMMITTEE *‘MEDIUM” DEATH 
RATE EQUALS ACTUAL 1940 RATE* 














White persons Colored persons 
Age group 
Male Female Male Female 
Birth to 0-4¢ 1944 1942 t t 
0O- 4 1951 Ultimate§ 1942 1947 
5- 9 Ultimate§ 1972 1970 1943 
10-14 Ultimate § 1962 t t 
15-19 1947 Ultimate § t 1942 
20-24 1944 1962 t t 
25-29 1944 1967 t t 
30-34 1944 1955 1942 1941 
35-39 1941 1947 1941 t 
40-44 t 1947 1941 t 
45-49 t 1945 t t 
50-54 t 1941 t t 
55-59 t 1941 t q 
60-64 t t 4 " 

















* Rates for quinquennial calendar periods assumed to apply to middle year, with linear interpola- 
tion when actual rate falls between assumed ones. Native white rates used for white persons and Negro 
rates for colored persons. 

t See footnote t, Table I. 

t Actual 1940 rate higher than assumed 1940 “medium” rate derived as described in footnote * of 
Table IIa and IIb. 

§ Actual 1940 rate lower than assumed ultimate (1975-79) “medium” rate. 

| At most of the ages beyond 55 actual colored death rates are lower than the NRC “medium” 
estimate for 1940 (see Table II). 


The derivation of the actual five-year time interval birth rates was, 
in general, performed exactly according to the definition which relates 
births in the given quinquennium (1935-39) to women surviving at the 
end of the period according to what the age of the mother would have 
been at that time (regardless of whether or not she actually was alive 
then). In order to compute such rates, there was first obtained the dis- 
tribution by single years of age of mother and by race (white and col- 
ored) for reported births in each of the years 1935-39.5 

The births for each of the five years were then collected on a “step” 
basis so that there was obtained the number of births in the quinquen- 
nium to women at each given single age in 1939. For example, the figure 

5’ From various issues of Vital Statistics of the United States, Bureau of the Census. A slight ap- 
proximation had to be introduced since single age data are not available for Massachusetts and New 
Hampshire, but rather only quinquennial values; it was assumed that the distribution within each 


quinquennial age group was the same for these two states as for the remainder of the country, a pro- 
cedure which can produce only a negligible error. 
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for women age 20 in 1939 included births to women age 20 in 1939, age 
19 in 1938, etc. Since the distribution of the total female population is 
known only as of the census data, April 1940, and since the births were 
related in effect to ages of women as of the middle of 1939, a further ad- 
justment was necessary. Linear interpolation between the births at 
single years of ages was performed to make this ? year adjustment. 
In other words, it was assumed that ? of the births to women aged z 
in 1939 applied to women aged x+1 as of the census data in 1940, 
while the remaining } applied to women aged z then. Following this 
subdivision, quinquennial totals were obtained and divided by the 1940 
population figures to produce the desired birth rates on a basis unad- 
justed for under-reporting of births. Such adjustment was then made 
on the basis used by Whelpton;! namely, an under-reporting factor of 
96 per cent for white births and 86 per cent for colored births. If re- 
porting accuracy has improved, the finally resulting “actual” birth 
rates are overstated relative to the NRC assumed rates, but as yet there 
seems to be no specific evidence on this point. 

Table IV shows the actual 1935-39 rates for each quinquennial age 
group from 10-14 to 55-59, while Chart III deals with the range 15-19 
to 45-49 since the NRC did not present figures for the youngest quin- 
quennial age group nor for the two oldest ones, probably because of 
their small magnitudes. In addition, these actual rates are shown as 
percentages of the low and high NRC assumptions, with their rates for 
native and foreign-born whites being blended as was done for death 
rates and likewise the Negro rates being assumed to apply to colored 
persons. The total of the actual rates for all ages is indicative of the 
gross reproduction rate, although not an accurate expression thereof. 
For white females the actual rates tend to be higher than the low assump- 
tions except for the youngest and oldest age groups but, with only one 
exception are lower than the high assumptions. Thus at age 25-29 
the actual rate is almost 20 per cent in excess of the low assumption and 
slightly greater than the high assumption. In the aggregate, the actual 
birth rates are about 10 per cent greater than the low assumptions and 
slightly lower than the high assumptions. It will be noted that in Chart 
ITI there is a considerable margin between the actual 1935-39 rates and 
the low assumption for 1975-79 (the high assumption for that period 
being the same as the high assumption for 1935-39). 

The actual birth rates for colored females, when contrasted with the 
NRC assumptions in Table IV, do not show the same smooth relative 
trends as were present for white females. As with mortality rates, the 
greater fluctuation undoubtedly arises from inaccuracies and incom- 
pleteness of reporting. In the aggregate, the same trends are shown as 
for white females; namely, an excess of about 10 per cent over the low 
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assumptions and a slight deficiency as compared with the high assump- 
tions. 

In summary, it may be said that our actual mortality rates have im- 
proved appreciably more than the National Resources Committee esti- 
mated for the younger ages, whereas at the older ages there appears to 
be some indication of less improvement, especially for white males. In 


TABLE IV 
ACTUAL 1935-39 FIVE-YEAR TIME INTERVAL BIRTH RATES FOR 
QUINQUENNIAL AGE GROUPS* COMPARED WITH ASSUMED 
NATIONAL RESOURCES COMMITTEE RATES} 

















White females Colored females 
Age group Actual As per cent! As per cent Actual As percent | As per cent 
rate of “Low” | of “High” rate of “Low” of “High” 
10-14 a t t 1.6 t t 
15-19 55.1 91 85 201.3 129 115 
20-24 430.7 106 99 716.3 101 91 
25-29 616.7 117 103 675.7 121 106 
30-34 496.8 110 97 492.4 118 105 
35-39 312.6 104 90 304.8 87 7 
40-44 163.9 100 7 209.3 118 101 
45-49 42.6 97 84 55.5 97 82 
50-54 3.1 t t 8.3 t t 
55-59 ok t t .6 t t 
Total 2121.7 109 97 2665.8 110 97 




















* Derived from recorded birth statistics for 1935-39 and the 1940 census as described in text. 

t+ Native and foreign-born white rates weighted by respective proportions in 1940 census to give 
totai white rates. Negro rates assumed to apply to total colored persons. 

t National Resources Committee did not present figure’ for this age group. 


any future projections it would seem advisable to introduce an even 
more drastic reduction in mortality rates at ages up to 35 (disregarding 
the immediate and near-future effects of the war, particularly upon 
males).In this connection it may be noted that the 1941 Hypothetical 
Life Table,* which indicates the plausible long-range future level of 
mortality and was developed from a corresponding table prepared in 
1933, made changes in mortality rates only at ages under 50. 

Our actual birth rates have been slightly higher than the medium 
assumptions of the National Resources Committee but below their high 
assumptions. As for the main child-producing period, 20-34, the actual 
birth rates have been about as large as the high assumptions, but at the 
older ages this tendency was not maintained. A more detailed analysis 
by child-order would show that this trend arises from the decline in the 
birth rate for third and higher children. 

The drastic increase in the birth rate since 1939 would, of course, 


§ Statistical Bulletin, Metropolitan Life Insurance Company, November 1941, p. 3. 
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raise the actual rates considerably above the Nationa! Resources Com- 
mittee assumptions which were based on the birth rates in the trough 
of the depression, 1930-34. Most authorities are prone to believe that 


CHART ITI 


COMPARISON OF ACTUAL 1935-39 FIVE-YEAR BIRTH RATES AND 
THOSE OF NRC—WHITE FEMALES 





BIRTH RATE PER 1,000 
700 
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the present increase is purely a temporary phenomenon and that after 
the war age-specific birth rates will continue to decline as is provided in 
the NRC medium and high assumptions. It is not at all impossible that 
the present fertility state of the nation is merely a forerunner of what 
might be expected after the war when economic reconstruction along 
the lines of the Atlantic Charter will produce social conditions favor- 
able to entirely higher fertility trends. 








CHOICE OF THE DEPENDENT VARIABLE 
IN REGRESSION ANALYSIS 


By FrepErRIcK V. WauGER 
Food Distribution Administration 


N THE SIMPLEST PROBLEM of two variables it is always possible to cal- 
I culate two linear regressions: one with X; as the dependent variable, 
and one with X2 as the dependent variable. Unless there is perfect corre- 
lation between the observed values of X; and X¢2 these two regression 
equations will be different. If we plot both on a diagram they will 
cross each other. How do we know, in any particular case, which re- 
gression equation to use? 

In a multiple regression problem with n variables we can always cal- 
culate n different regression equations treating each variable, in turn, 
as dependent and the other (n— 1) variables as independent. How do we 
know, in multiple regression, which of these n equations to use? 

There appears to be a good deal of confusion about this rather simple 
matter. A common rule of thumb procedure—particularly among econ- 
omists—is to select as the dependent variable the one which is taken to 
represent the “result” or “effect,” while the independent variable, or 
variables, are taken to represent “causes.” Those who follow this pro- 
cedure often argue that only one of the n regressions has any real mean- 
ing or any use. 

This procedure finds support in some of the best and most recent text- 
books. To illustrate this, we shall make several references to the recent- 
ly revised edition of Dr. Ezekiel’s excellent book.! Similar references 
might be found in other textbooks.’ 

Dr. Ezekiel* writes, 

It is always necessary to think over the problem carefully, and decide 
which variable we are going to regard as the independent or causal variable 
and which one as the dependent, or resultant. Thus, if we were relating 
variations in tobacco yields to applications of fertilizer, obviously the dif- 


ferences in fertilizer would be the cause and differences in yield the result, so 
we would sort our records according to the differences in fertilizer. 


The last clause might just as well have been written, “so we must use 
yields as the dependent variable.” Dr. Ezekiel states here that we should 
always choose the dependent variable from considerations of cause and 


1 Mordecai Ezekiel, Methods of Correlation Analysis, 2nd edition, John Wiley & Sons, New York, 
1941. 

2 For example, Albert E. Waugh, Elements of Statistical Method, McGraw-Hill Book Company, 
New York, 1938. See headings, “Nature of Multiple Relationships” and “Dependent and Independent 
Variables,” on pps. 318-319. 

3 Ezekiel, op. cit., pp. 50-51. 
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effect, but in the following paragraph he indicates that this procedure 
is not always possible. Thus, he writes:4 


It is frequently impossible to say which variable is the cause and which 
is the effect. All that can be definitely established is that the two vary 
together. Yet one may wish to regard one variable as the one whose values 
are given or known. It is then called the independent variable. ... The 
second variable will then be regarded as the one whose values are to be 
related to, or estimated from, the values of the known variable. It is then 
called the dependent variable since it is treated as depending upon the given 
values of the independent variable. 


These two quotations appear to summarize Dr. Ezekiel’s views on 
this problem. He seems to advise us to follow the cause and effect prin- 
ciple whenever we know which is which. When we don’t know which is 
cause and which is effect, or when both variables are results of a com- 
mon cause, we are advised to use another criterion in choosing the de- 
pendent variable. Not only does Dr. Ezekiel advise us to follow this 
procedure, he follows it consistently himself in all the problems pre- 
sented in his book. Whenever one variable is clearly the result, it is 
chosen as the dependent, and in such cases Dr. Ezekiel seems to feel 
that only this one regression equation has any meaning or any use. It is 
largely on these grounds that Dr. Ezekiel recommends in his technical 
appendix’ a computational method which gives only one of the n mul- 
tiple regression equations, when it would be possible with very little 
additional work to get them all.® 

What about this procedure? Does it give us the regression equation 
we want in each particular case? 

Suppose we take the example given by Dr. Ezekiel: tobacco yields 
and fertilizer applications. Let us imagine that we have surveyed 100 
farms in North Carolina, and have found the average yield of tobacco 
and the rate of fertilizer application on each farm. We want a line or 
equation which will express the relation between the two variables: 
tobacco yield and fertilizer application. There are two such lines (neg- 
lecting the possibility of choosing between different types of curves)— 
one with yield dependent and one with fertilizer dependent. Dr. Ezekiel 
says that variations in fertilizer are obviously to be considered as the 
cause of the variations in yield, and, therefore, that yield must be the 
dependent variable. Granting that the direction of the cause and effect 
relation is clear in this case, the writer fails to see any reason for con- 


4 Op. cit., p. 51. 

5 Op. cit., Appendix I, pp. 455-485. 

* Frederick V. Waugh, “A Simplified Method of Determining Multiple Regression Analysis,” this 
JournAL, December 1935, and “The Analysis of Regression in Subsets of Variables,” this JounNat, 
December 1936. 
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cluding that yield must be the dependent variable. We might use this 
set of data to answer either of the following questions: 

a. What were the average yields of tobacco associated with various 
amounts of fertilizer? The answer to this question is provided by the 
regression with yield as the dependent variable. 

b. What were the average amounts of fertilizer associated with vari- 
ous yields of tobacco? The answer to this question is provided by the re- 
gression with fertilizer application as the dependent variable. 

We shall not prove these statements here. The proofs can be found in 
practically any textbook on correlation and regression.’ The point is 
that both regressions have definite meanings and definite uses, and our 
choice does not depend at all upon which is cause and which is effect, 
but simply upon the question we want answered. We may very well 
want both regressions, because we may very wel! want to answer both 
questions. 

The same point may be illustrated with any problem which may come 
to the reader’s mind. However clear the direction of the cause and effect 
relation may be there are always two essentially different questions 
which may be asked in any problem of two variables. If we want the 
average, or expected, value of X, associated with any particular value of 
X:, the answer is given by the regression with X, dependent—regardless 
of the nature of cause and effect. If we want the average, or expected, 
value of X2 associated with any particular value of X;, the answer is 
given by the regression with X, dependent—regardless of the nature of 
cause and effect. 

If we want to estimate the average, or expected, height of fathers 
corresponding to a given height of sons, we should take height of fathers 
as the dependent, regardless of the fact that this must be the causal 
factor. If we want the average amount of potato production associated 
with parity prices of potatoes, we should take potato production as the 
dependent, even if we are sure that variations in production cause the 
variations in price. 

In a big multiple regression problem we often think in terms of one 
particular regression. Thus, in a farm management survey we may be 
concerned with a large number of (causal) factors which affect farm in- 
come (the resultant). In such a problem our thinking tends usually to 
run in the direction of investigating what changes in average, or ex- 
pected, income would tend to be associated with particular adjustments 


7 For example, in Ezekiel, Technical Note 2, on page 489. Note that Y is selected as the dependent 
variable without any consideration of whether it represents cause or effect. In any case the regression 
minimizes the sum of squares of the Y residuals. And in any case a similar mathematical development 
would show that by choosing X as the dependent variable we minimize the sum of squares of the X 
residuals. 
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in the several causal variables. We tend to frame our questions in such 
terms as, “How much increase in income could be expected on this farm 
if we added 50 chickens?” or “If these farmers fed each of their cows 
another pound of feed each day, how much would their average income 
be increased?” Both of these questions are answered best by the re- 
gression with income as the dependent variable. 

Does this mean that the other (n—1) regressions have no value? Not 
at all. Let us think of our problem in different terms. Let us suppose 
that some of the farmers in our survey must increase their incomes by 
$200 a year to get a minimum standard of living. They hope to do this 
by raising more chickens, but without making any other adjustments. 
We want to know then the average number of chickens associated with 
the higher income and with the same quantities of the other factors. The 
answer to this is given by the regression with the number of chickens as 
the dependent, and with income and all the other variables as independ- 
ents. 

Further examples should be unnecessary. The reader can easily apply 
similar reasoning to any problem of related variables, and will find that 
in each case there will be as many useful regression equations as vari- 
ables, and that each of these equations provides the answer to a reason- 
able question. 

Now, to summarize, and to add a few general comments. 

1. In any problem of relationships between n variables, each of the n 
regression equations has a definite meaning, and each can be used to 
answer a specific type of question. 

2. Even when our problem is such that one variable is clearly the “re- 
sult,” and the other variables are clearly “causes,” we must choose one 
of the “causal” variables as the dependent if we wish to estimate the 
average, or expected, value of that variable corresponding with any set 
of values for the others. 

3. We may very often wish to estimate the average, or expected, value 
of one of the “causal” variables needed to produce a stated “result.” In 
any such case the “causal” variable becomes the dependent. The “re- 
sultant” variable, and any other “causal” variables are treated as inde- 
pendents. 

4. There is no sound reason for the common rule of thumb practice of 
always designating current price as the dependent when relating it to 
such variables as current supplies and current incomes of consumers; 
nor for always designating consumption as the dependent when it is re- 
lated to current retail prices; nor for always designating acreage of a 
crop as the dependent variable when it is related to prices of that crop; 
nor for always designating the yield of a crop as the dependent when re- 
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lating it to fertilizer applications and other variables representing cul- 
tural practices. This rule of thumb practice is based on the mistaken 
idea that the only useful regression equation is the one with the “re- 
sultant” variable as the dependent. 

5. Nothing in this paper implies, in any way or in any degree, that we 
can neglect careful logical analysis in the statistical measurement of re- 
lationships. The writer subscribes completely to Ezekiel’s chapter® em- 
phasizing the need for such logical analysis. The point is only that con- 
siderations of cause and effect do not indicate which variable should be 
dependent. 

6. We are concerned in this paper only with the classical regression 
equations. These are to be used whenever our purpose is to estimate 
values of one of the observed variables. If, however, we assume perfect 
correlation between the true, underlying values of our variables, we are 
assuming that departures are due to errors of observation, or to what 
Frisch® calls “disturbances.” If we can estimate the relative magnitudes 
of these errors or disturbances in each of the variables it is possible to 
calculate one line in the case of two variables, or one n-dimensional 
plane in the case of n variables expressing the relationship between 
the true, or basic values. In this case there is only one line or plane, and 
no one variable is taken as dependent. The so-called “orthogonal re- 
gression” assumes equal errors, or disturbances, in each variable. No 
such line or plane gives any measure of correlation, since correlation is 
assumed to be perfect among the true, underlying values. Neither does 
such a line or plane provide an estimate of the average, or expected, 
values of any of the observed variables. 

8 Op. cit., Chapter 24, pp. 442-453. 


® Ragnar Frisch, “Statistical Confluence Analysis by Means of Complete Regression Systems,” 
Universitets M@konomiske Institutt, Oslo, Norway, 1934, especially Part II, pp. 47-83. 


COMMENTS 


By Morpecai EZEKIEL 
United States Depariment of Agriculture 


Dr. Waugh agrees that where we wish to estimate the most probable 
value of one variable from a given value of another, we take the latter 
as the independent variable, and the former as the dependent. He then 
goes on to say, however, that “both regressions have definite meanings 
and definite uses, and our choice does not depend at all upon which is 
cause and which is effect, but simply upon the question which we want 
answered.” My own belief is that in cases where we have definite non- 
statistical knowledge of which variable is cause and which is the effect, 
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the reverse regression, which measures the average or most probable 
value of the “cause” associated with given values of the “result,” is gen- 
erally of little significance and is likely to be used for illogical or incorrect 
conclusions. Where, by the use of multiple correlation, we can eliminate 
other associated causes and come still closer to a measurement of the in- 
fluence of each separate cause upon the result, the use of the reverse 
partial regression is, I believe, likely to be even more misleading. 

The misleading character of the reverse regression is shown by Dr. 
Waugh’s own illustration of farmers who wish to increase their incomes 
by $200 a year by raising more chickens, while holding the rest of their 
farming unchanged. Dr. Waugh would apparently determine this by 
finding the net regression of chicken numbers on farm income, while 
holding constant the other independent variables. Yet, for each individ- 
ual farmer involved, what he needs to know is not “What numbers of 
chickens, on the average, are associated with given incomes?” but rather 
“How many chickens need I add to my farm to increase my income 
$200?” What the farmer can do is to change the number of chickens ; that 
is the cause which he will employ to change his income. The most accu- 
rate way to find how much each chicken adds to income is to employ the 
net regression of income upon number of chickens. If this comes out 
50 cents, that will mean the 400 chickens will be the number probably 
required. If the question is restated as how many chickens for a given 
sum instead of how much money per chicken, that does not change the 
fact that the chickens are what the farmer can actually change. 

This differs from Dr. Waugh’s point of view. He states “we may wish 
to estimate the average, or expected, value of that variable (chickens) 
corresponding with any set of values for the others (income).” While 
“we” may at times wish to know how many hens the $500 farmer had, 
and how many the $700 farmer had, the farmer has only an academic 
interest in those facts. What he wants to know is, how much effect will 
a change in hens have upon his income, for the hens are the only thing 
he can vary directly. The income which will follow as a consequence is 
still unknown when the hen numbers are decided upon. 

I do agree that if we were to take new sample farms from the same 
universe, and to calculate for those new samples the average number of 
hens on farms having given incomes, the best advance estimate we 
could make of that relation would be based on the regression of hens on 
income in our original sample. But I question whether that information 
is of any value to the farmer who has to decide what he will do with his 
poultry flock—or for that matter, is often of very much value to anyone 
else. 

There has been much fallacious reasoning based upon the reverse 
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regression approach. One common illustration is the type of farm man- 
agement report which shows the average number of acres of each crop 
and head of each class of livestock on farms reporting various levels of 
income. It has been the evident conclusion that because the men who 
had high incomes had more dairy cows, it would therefore pay the men 
who had lower incomes to increase the number of their cows. Yet ac- 
tually a classification of the same data by dairy cownumbers might show 
only insignificant changes in average incomes with differences in num- 
bers of cows. Similarly, in the poultry case, because of some other vari- 
able not explicitly measured—such as the ability of the more intelligent 
farmers to handle all their livestock better—the data might show rel- 
atively small increases in chickens associated with very large differences 
in income, perhaps one more hen for each ten dollars more income. Dr. 
Waugh would then conclude that a small change in chicken numbers 
was associated with a corresponding large increase in income. This in- 
formation would tend to mislead the individual farmers, who after all 
were in position to change their chicken numbers but not to change 
their own inherent abilities. In fact, the lower the correlation between 
income and hens, and the smaller the differences in hens associated with 
given differences in incomes, the fewer the number of hens which this 
method would indicate as needed to increase farm income by $200! 

In view of these considerations, I still believe that the procedures rec- 
ommended in my book are the most satisfactory for the practical re- 
search worker who expects to use his research as a basis upon which to 
advise on the probable consequences of actions in the real world, for 
in that world, people need to know what effects (results) are most likely 
to follow from given things they may select to do (causes). 
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NOTE ON A COMMON STATISTICAL INEQUALITY 


By Irvine H. Sreceu 
U. S. Bureau of Labor Statistics 


HIS PAPER is devoted to the discussion of a family of inequalities 
frequently encountered in statistical work. The prototype relation 


SS st >> uw = Disvd ul (1) 


> st > sv 
z. ut ym uv | 


where all of the sums include the same number of items. In the termin- 
ology of Hardy-Littlewood-Pélya, the inequality is “homogeneous in 
>>” as well as in the variables, and the two sides of (1) are not necessar- 
ily “comparable.” That is, the sense of the inequality is not affected by 
the substitution of means for the corresponding sums, but it need not 
be fixed for all non-negative values of the variables.' 

In the following, we shall first present some useful identities related 
to (la) and then analyze some cases of the inequality which are of inter- 
est to the statistician. The validity of the identities is independent of 
the sign of the variables. In some of the inequalities mentioned later, 
however, one or more of the variables will be restricted to non-negative 
values. In any case, only real discrete variables are considered; and all 
of the sums are finite. Analogous inequalities generally exist for inte- 
grals and infinite sums. 

The factorability of the matrix of A into two rectangular ones is a 
very useful property. If each of the sums contains n elements, we have 


_ || Lest do sv | meee Gl 
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| 
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For our purposes, the rectangular matrices may be considered permuta- 
ble: the multiplication is performed row by row, and the value of A is 
not altered by the interchange of diagonal terms. The value of A also 
remains unaltered if the order of the rows is reversed in both rectangu- 
lar matrices, but its sign is changed if the order is reversed in only one. 
The matrix of A may also be decomposed into products of other rec- 
tangular matrices; for the sums in A may be expressed in terms of new 
variables (derived from the original ones) without being altered in value 
or general form. 


or 


A= (la) 
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1G. H. Hardy, J. E. Littlewood, and G. Pélya, Inequalities (1934), pp. 1-6. 
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By virtue of (2) or any of its variants, A may be transformed into a 
double sum containing the ,C2 products of second-order determinants 
obtainable from the rectangular matrices. For example, it follows from 
(2) that 
t; t; 


vi =U; 


ae ei* * (3) 


t=1 j=i+l 














Ui Uj; 


In many special cases of importance, each determinant product is 
positive (or negative) and A is necessarily greater (or less) than zero. 

The sum (3) may be regarded as the Cartesian form of a well known 
vector identity which is often associated with the name of Lagrange. 
If the s;, t;, ui, and v,(i=1, ---, nm) are the n rectangular coordinates 
of the vectors s, t, u, and v, respectively, then 


s:‘t s-v 
= (s X u)-(t X v), (4) 








u-t u-v 


where a dot signifies a scalar product and a cross signifies a vector 
product.” As will be illustrated later, it is sometimes enlightening to 
translate (4) into its trigonometric equivalent. 

There is often some advantage to be gained by the introduction of a 
weighted correlation coefficient which always has either the same sign 
as A or exactly the opposite.’ In either case, the correlation coefficient 
and A vanish identically. For example, we may write 


z{n~- Ese 


= 1M st: (u/s) -(v/t)Fst: (u/s)Fst: (w/t) ( - st)?, 


A 


where 1s¢:(u/s)-(v/t) 18 the coefficient of correlation between the u;/s; and 
v;/t; with weights sit; and ose:(u/s) ANd oeet:(v/e) are similarly weighted 
standard deviations. We may also express A in forms analogous to (5) 
containing the correlation coefficients Tu»: (s/u)-(t/v), —Tev:(u/s)-(t/v), and 


2 W. F. Osgood, in his Advanced Calculus (1935), p. 306, follows W. Blaschke, Vorlesungen iiber 
Differentialgeometrie (1921), I, pp. 6 and 13, in attributing (4) to Lagrange. But Lagrange’s contribution, 
included in a paper presented in 1773, is simply an algebraic identity corresponding to the case of (3) in 
which n =3, sj =t;, and uj =7;. Some writers (e.g., Courant and Goursat-Hedrick) ascribe only this case 
to Lagrange. See T. Muir, Theory of Determinants in the Historical Order of Development (2nd ed., 1906), 
I, pp. 33-37. 

3 For applications of weighted correlation coefficients, see the following articles by the writer in 
this Journa.: “The Difference between the Paasche and Laspeyres Index-Number Formulas,” Septem- 
ber 1941, pp. 343-350; “Further Notes on the Difference between Index-Number Formulas,” December 
1941, pp. 519-524; and “Index-Number Differences: Geometric Means,” June 1942, pp. 271-274. See 
also the papers of L. von Bortkiewicz mentioned in the first of these articles. 
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—Tut:(s/u)-(@/t). In each instance, A is numerically equal to a weighted 
product moment times the square of the sum of the weights. A useful 
relation is thus established between product moments and double 
sums. 

To emphasize the symmetry between the correlation and double- 
sum forms of A, we may explicitly introduce into (3) the variables and 
weights entering into the correlation coefficient in (5): 


= 2 U; Uu\/%j 
A= p>» Ps stot ~* —_ =) (= — =). (3a) 


i=l jmi+l 8 8i/ Xb; t; 
Similarly, we may rewrite (3) to correspond to any of the other weighted 
correlation coefficients appearing in alternative forms of (5). Inciden- 
tally, variants of (3) such as (3a) could, of course, be obtained directly 
from variants of (2) if the sums in | A] were first rewritten appropri- 
ately. 

We now turn to an examination of some of the inequalities repre- 
sented by A. First, we shall consider instances in which the sense of the 
inequality is not fixed. 

An important example is the index-number inequality, which has al- 
ready been discussed at some length in this JourNAL.‘ If the u; and s; 
represent, say, alternative sets of quantity weights and the »v; and ¢; 
represent prices in two periods, the latter being the base, then A gives 
the sign of the difference between two aggregative price indexes. The 
sign of the difference between the same two indexes converted to 
weighted arithmetic meansof the relatives v;/t;isgiven by A=) ST),.UV 
—-y-sv>> UT, where S;=sili, T.= 1, U;=st;(u;/s;), and V;=0,/ts; the 
two sets of weights are U; and S;. This result is widely applicable, for 
any harmonic mean of relatives is also equivalent to a weighted arith- 
metic mean. In the special case where the u;/s; (i.e., the ratios of the 
alternative sets of weights for either the aggregative or mean forms) 
represent quantity relatives corresponding to the price relatives being 
averaged, we have the difference between the Paasche and Laspeyres 
indexes. From (3a), it is evident that the Paasche index is necessarily 
greater than the Laspeyres index if the coefficient of rank correlation 
between the two sets of relatives is +1, and necessarily smaller if the 
value of the coefficieat is —1. Tchebycheff obtained an equivalent re- 
sult for analogous integrals.® 

The tetrad difference, ra.sT:.a—Ta-d’c-b, is a fundamental notion of 
psychometrics and also occurs in a simple case of vector correlation.® It 


4 See papers by the writer mentioned in footnote 3. 

5 Inequalities, pp. 43-44. 

* For the special case of vector correlation having the tetrad difference as numerator, see H. Hotel- 
ling, “Relations between Two Sets of Variates,” Biometrika, December 1936, p. 324, or C. C. Craig, 
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may be represented by A if the variables s;, t;, u;, and v; are replaced by 
a,’ =(a;— )a/n)/V/noa=(ai— >a/n)/[D0(a—Soa/n)?]"? and analo- 
gously defined b,’,c,’, and d,’. The tetrad vanishes when a weighted corre- 
lation coefficient such as Tq): (c/a). (a’ /b’) OF Tea’ (a’ fe’) -(b’/a’) EQUals zero; or, 
in geometric terms, when the planes a’, c’ and b’, d’ (the vectors have the 
coordinates a,’, c,’, b;’, and d,’, respectively) are at right angles. 

The sign of the partial correlation coefficient r.s.. is given by the 
sign of its numerator, 1o.s—Ta-cl'b-c=Tec-cl'a-b—Ta-cTb-c, Which is in the 
form of A for the variables a,’, b,;’, and c,;’=d,’ defined as in the preced- 
ing paragraph (whence )_c’?=1). Thus, the sign of the partial coefficient 
corresponds to that of Ta’b’: (e’ Ja’) +(e’ /b’) OT 1er2-(a’ fe’) + (b’ fe’) + 

The signs of the constants of the regression equation Y =f+gX may 
also be considered to depend on expressions like A which form the 
numerators of the determinant solutions to the normal equations.’ For 
example, if the observation equations are weighted by w;, then the 
following substitutions in A give the numerator of the regression co- 
efficient g: s;=w;, t;=1, u;=w;X;, and v;= Y;. If the observation equa- 
tions are unweighted, then s;=t;=1, u;=X;, and v;=Y;,. The cor- 
responding weighted and unweighted correlation coefficients, r.:x.y and 
rx.y, have the same signs as these A’s. 

We now turn to instances in which the two sides of (1) are “compara- 
ble”—in which the sense of the inequality is fixed for at least non-nega- 
tive values of the variables. In each case, we shall make the left mem- 
ber of (1) equal to or greater than the right; that is, A will be positive 
except in cases of equality. 

A familiar definitely directed inequality is the difference between the 
(weighted or unweighted) arithmetic and harmonic means of a set of 
positive quantities. Thus, if the z; are positive, }\z>z-!=n? is clearly 
valid, for the determinant products derivable from 


dz n |- SS eal 


n 2 a '---g,* 
are all non-negative. 

Another common relation which may readily be verified by the meth- 
ods described in this paper is the following moment inequality for 
(weighted) positive quantities: 


dpe _ De pet Do pan 
>t ep te dr 


“Recent Advances in Mathematical Statistics, II,” Annals of Mathematical Statistics, March 1942, p. 76. 
For vector treatment of tetrads, see E. B. Wilson, ‘‘On Hierarchical Correlation Systems,’’ Proceedings 
of the National Academy of Sciences, 1928, pp. 283-291. 

7 The determinant in the denominator also has the form of A but may be shown by (3) to be neces- 
sarily positive; in fact, this determinant is an example of Cauchy's inequality, which is discussed several 
paragraphs later. 
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Decomposition of the matrix 
| LP Do pe || 
Spe YD pen | 


shows that the typical determinant product is non-negative. This in- 
equality may be extended by repeated application to the factors on the 
right-hand side. 

The validity of the following moment inequalities, in which the 
weights p; are positive, may also be established readily by examination 
of the typical determinant products: 

~ pd pe = (D> pe)? 
E pet D pe & (Epes) 
Do pant? Do pew! = (2 pet)? 
The first and second of these inequalities hold for negative as well as 
positive values of z; and of the exponents; the third holds for mixed 
values of z; if m is an odd integer. 

The most famous special case of (1) is Cauchy’s inequality, which 
occurs not only in a variety of statistical problems but also in the 
derivation of more complex inequalities, like Hélder’s and Liapounoff’s. 
The matrix of A for Cauchy’s inequality may be written as the prod- 
uct of two identical rectangular matrices; the typical determinant 
product is a perfect square. All but one of the definitely directed in- 
equalities which have already been considered may be written in 
Cauchy’s form. For example, the expression giving the sign of the 
difference between an arithmetic and a harmonic mean is equivalent to 
the following: 


LeLet =n = Dey Dd [ey 2} - (L2e)"*f 2 0. 


Cauchy’s inequality is equivalent to the statement that the cosine 
of the angle between two vectors is never greater than unity. Since 
any correlation coefficient may always be written in cosine form, its 
value obviously cannot exceed +1, as is well known. By virtue of (3) or 
(3a), various correlation coefficients may be reduced to cosine form in 
more than one way; that is, the difference between the denominator 
and numerator may be given by more than one Cauchy inequality. For 
example, at least two such expressions exist for the weighted zero-order 


coefficient: 
fog we ee la = E wa/E wil - Y wb/E w) 
: [do w(a — > wa/ >> w)? © w (6 — >> wb/ >> w)?}/2 
ss Dei Doi Wit ;(a; — a;)(b; — b;) 
7 [doi do; wiv ;(a; — a;)? >> ww,(b; — b;)?}!/2 
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In the unweighted case, we have 
rey = ee (@ = Lea/ny(b ~ DL d/n) 
[Do (a — Do a/n)? >! (b — 2) b/n)?]”? 
Di Di (a; — a;)(b; — bj) ; 
[Dos Dos (as — as)? Dos Do; (bs — by)?" 


The first-order coefficient r.5.. may be expressed in an alternative 
cosine form involving the a,’, b,’, and c,’ defined in an earlier para- 
graph: 








Te-cTa-b — Ta-clb-c 





Tab-c = j i/2 
(Te-Pe-s cores a (95.0% e-e _ Tcl b-c) 2 


Di Di (ci’a;’ — ¢;'a;")(ci’d;’ — c;'bi’) 
[doi do; (ei’a;’ — ¢;'a;")? 0D; (ci'b;’ — ¢;'b;")?]*/? 


Partial coefficients of higher order may be written in analogous fashion 
in terms of multiple sums of products of higher-order determinant 
pairs. 

In conclusion, it may be well to recall at this point the analogy be- 
tween the first-order correlation coefficient and the fundamental cosine 
law of spherical trigonometry.* Since the latter may be represented by 
a Lagrange identity,® so may the former. Thus, if a’, b’, and c’ are 
vectors of unit length having the rectangular coordinates a,’, b;’, and 
c;’, respectively ({=1, ---,; >, a=) > b?=) c?=1), then 


/ 


(c’ X a’)-(c’ X b’) = c’-c'a’-b’ — a’-c'b’-c’. 





The partial coefficient corresponds to the cosine of the dihedral angle 
between the planes c’, a’ and c’, b’. The scalar products a’-b’, a’-c’, 
and b’-c’, which represent the cosines of the angles subtended by the 
sides of the spherical triangle, also correspond to the simple correlation 
coefficients rq.», Ta.c, and ry... The vector identity may readily be trans- 
lated into the double-sum coordinate form of the partial coefficient 
shown in the preceding paragraph. 

8 See D. Jackson, “Trigonometry of Correlation,” American Mathematical Monthly, June 1924, 
pp. 275-280, or R. W. Burgess, Introduction to Mathematics of Statistics (1927), pp. 275-276. For a com- 
prehensive treatment of correlation in general, see J. MacMahon, ‘““Hyperspherical Goniometry; and 


Its Application to Correlation Theory for N Variables,” Biometrika, December 1923, pp. 173-208. 
* See, for example, C. E. Weatherburn, Elementary Vector Analysis (1939), p. 72. 
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CONTENTS OF TIPPETT’S “RANDOM 
SAMPLING NUMBERS” 


By Ropert Gaag, M.S. 


Section on Biometry and Medical Statistics 
Mayo Clinic, Rochester, Minnesota 


s PART of a plan for setting up a series of punch cards to be used for 
A experiments with random samples, there were preliminarily 
punched onto Hollerith cards all the numbers in Tippett’s tables of 
random sampling numbers.! In view of the wide use of this table for ran- 
dom experiments of various sorts, it was thought that it would be of 
some interest to report briefly a portion of the results obtained from 
tabulating these cards. Previous analysis of Tippett’s data have been 
made by Yule? and by Dodd.* 

The table contains 41,600 digits printed on twenty-six pages, each 
page containing thirty-two columns and fifty rows; on each page the 
columns are assembled in groups of four. The fifty digits of each column 
were punched onto a card, one card for each column, thus providing 
832 cards that contained all the numbers in the table. In order to check 
the punching the entire procedure was repeated independently on an- 
other set of cards and checked mechanically. 

Table I gives the frequency of each digit and a few summary findings 
for runs. The runs were counted mechanically on a tabulator as for one 
continuous series of digits constituted as follows. The top row of the 
first page was made continuous with the top row of the second page, this 
with the top row of the third page, and so forth, until the top row of the 
last page, 26, was completed. The latter was then made continuous with 
the second row of the first page, and the process continued through the 
fiftieth row of the last page to complete the series. 


TESTS OF RANDOMNESS 


The order of the digits can be examined, of course, by a great variety 
of statistical tests, each testing the “null hypothesis” that the digits are 
arranged at “random.” The various tests that can be constructed will 
be differently sensitive to different kinds of systematizations,‘ and by 


1L. H. C. Tippett, Random Sampling Numbers, Cambridge University Press, 1927. 

2G. U. Yule, “A Test of Tippett’s Random Sampling Numbers,” Journal of the Royal Statistical 
Society, Vol. 101, pp. 167-172, 1938. 

3 E. L. Dodd, “Certain Tests for Randomness Applied to Data Grouped into Small Sets,” Econo- 
metrica, Vol. 10, pp. 249-257, July-October 1942. 

4 Joseph Berkson, “Tests of Significance Considered as Evidence,” this Jounnat, Vol. 37, pp. 325- 
335, September 1942. 

J. Neyman, Lectures and Conferences on Mathematical Statistics, Washingtun, The Graduate School 
of the United States Department of Agriculture, 1938, Chap. 3, pp. 33-48. 
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noting which gives unlikely values for the test P, one might be able to 
discern what sort of systematization was present. A number of tests 
that were tried are given here as examples. 

1. The expected frequency for any digit and its standard deviation 
are given as pn and +/ pqn respectively. Any observed frequency, for ex- 
ample, 4,264 for the digit zero, can be tested by the ratio (1) which will 
be normally distributed (for large n) in random samples with mean zero 
and unit standard deviation. 

4,264 —4,160 
/0.1 X 0.9 X 41,600 
In this manner each of the digits can be tested separately to see whether 
there is any indication of preference or avoidance of particular digits. 

2. An over-all test that will, however, not be so sensitive or revealing 
as to preference or avoidance of particular digits is the x? frequency 
test which is given by (2). 
> (f — 4,160)? 

4,160 

3. A priori it would seem that tests based on “runs,” that is, repeti- 
tions of the same digit, would be sensitive to likely departures from 
true randomization. It has been found by experience that it is practi- 
cally impossible to maintain a random arrangement of “runs” in the 
presence of any kind of bias. An article by Mood® makes a number of 
such tests easily accessible. 

For the number of runs of length j for digit 7 the mean and variance 
are given in (3) and the distribution for large numbers is sensibly 
normal. 


E(rii) = piiqi[(n — 7 — 1)gi + 2) 
Or5;? = piriqa?{ (n — 2j)(n — 2j — 1)qs? + (n — 2j)qi(1 + 5p) 
+ 6p? — [(n —j — 1)9: + 2]?} 
+ piiqi[(n — j — 1): + 2). 
For instance, to test the runs of two of digit zero (Table I) 
E(ro) = (0.1)2(0.9) [(41,600 — 2 — 1)(0.9) + 2] = 337.0, 





B23, P = 0.09. (1) 





3 = 


x,? = 11.2, Py = 0.26. (2) 


(3) 


Org? = 321.2 
362 — 337 
—_——_ = 14, P = 0.08. 
4/321.2 


’ A. M. Mood, “The Distribution Theory of Runs,” Annals of Mathematical Statistics, Vol. 11, pp. 
367-392, December 1940. 
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That is, the probability of getting 362 runs or more of two in random 
samples is 0.08. If there were an abnormal tendency to pairing of this 
digit, one would be likely to get a much smaller value for P using this 
test. Hence, there is little reason to suspect any abnormal pairing 
here. Similarly, each of the observed frequency of runs for the different 
digits can be tested. 


TABLE I 
FREQUENCY AND “RUNS” OF EACH DIGIT 

















: rj, runs of length j 
e Frequency Wy 
Digit 1 2 3 4 5 Total runs 

0 4,264 3,406 362 39 3 1 3,811 

1 4,081 3,248 356 35 4 0 3,643 

2 4,233 3,452 333 29 7 0 3,821 

3 4,224 3,408 344 37 3 1 3,793 

4 4,171 3,377 330 38 5 0 3.750 

5 4,144 3,356 329 42 1 0 3,728 

6 4,051 3,262 339 33 3 0 3,637 

7 4,084 3,311 325 33 6 0 3,675 

8 4,144 3,328 366 24 3 0 3,721 

9 4,204 3,409 332 37 5 0 3,783 
Mean 4,160 3,355.7 341.6 34.7 4.0 0.2 3.736.2 
Expected mean 4,160 3,369.6 337.0 33.7 3.4 0.3 3,744.0 























4. An over-all test for the runs of any particular digit can be made 
from the total runs observed for the digit. The expected value and vari- 
ance are 


E(r;) = npiqi + pi? 
= npi(1 — 4p; + 6p;? — 3p,*) + pi2(3 — 8pi + 5p,?). 
For example, to test the total runs of digit zero (Table I) 
E(ro) = 3,744, 9,2 = 2,733 
3,811 — 3,744 
/2,733 


Similarly, each of the other digits can be separately tested. 


5. An over-all test can be made for the total runs for all the digits. 
It has been shown that the variable® 


Q= p oNE x; (5) 


: (4) 


Or 





1.3, P = 0.10. 


is asymptotically distributed according to the x? law with k degrees of 


‘Formula 5 is a quadratic form and the j is not the length of run as in 3. 
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freedom. In (5) the 2; are defined as 


Ti — NPiqi 
ep a. ¢=1,2,---,k 
/n 
and ||c*|| is the inverse of 
oii = pigs — 3pi7Qi? 
oi; = — pip;(1 — 2p; — 2p; + 3pipj). 


The z; are readily determined from the data given in Table I, with 
n=41,600; p;=0.1; g;=0.9 and k=10. For this particular set of data 
||o*|| becomes 


o%* = (2 — 3p)/p(1 — p)(1 — 2p) 

oi = (1 —3p)/p(l—p)(l1— 2p), ij. 
Substituting the numeric equivalents in (5), Q@=15.3 which as a value 
of x? for ten degrees of freedom gives P=0.12. 


6. A quick over-all test can be made for the runs of all the digits in 
terms of the expectation and variance for such runs. The values are 


E(r) = n(1 — DU pi’) 
t= n[d pi +22) pi? — 3(L p*)’). 
Thus, the test for the 37,362 runs in this series is 
E(r) = 37,440, —o,? = 3,744 
37,362 — 37,440 7 
Jae 


(6) 
Oy 


- 13, P = 0.10. 





The fact that a variety of tests have not disclosed any indication of 
systematization in runs does not mean that none exists. It may be that 
the particular tests used were not sensitive to the particular type of 
systematization which existed. The kind of systematization which 
would be introduced is likely to be related to the manner in which the 
table was constructed. Since the digits are grouped in the table in 
groups of four, the possibility that there is systematization within these 
groups suggests itself. Table II gives the number of runs for each digit 
found in the 10,400 groups of four digits in Tippett’s table. The runs 
may be tested by the same theorems as utilized before or by simple 
extension of them. 
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TABLE II 
OBSERVATIONS OF “RUNS” IN 10,400 GROUPS OF FOUR DIGITS 








Mean Digit 








Expected | Observed! 0 1 


| 
3 ,582)3 ,431/3 ,666 3,608'3 ,564 3 ,540/3 ,474)3 ,477)|3 ,534/3 ,599 

















Runs of one 3,556.8 | 3,547.5 

Runs of two 271.4 274.7 303} 291] 256| 276] 266] 266) 259] 273] 282| 275 
Runs of three 18.7 19.7 24 20 17 20 25 24 17 19 14 17 
Runs of four 1.0 1.0 1 2 1 1 0 0} 2 1 1 1 




















7. As an example, to test the number of runs of two of digit zero 
(Table II) the expected value for a single group of four digits and the 
variance are given by (3) 


E(ro2) = 0.0261, Or” = 0.0254. 
For the sum of such runs in 10,400 groups the expected value and 
variance are 

E(r) = 0.0261 X 10,400 = 271.4 

o°(r) = 0.0254 X 10,400 = 264.2 

303 — 271.4 _ 

V/ 264.2 7 

The observed mean value for runs of two for all ten digits was 274.7 as 


compared with an expected value of 271.4. A test’ gives 


274.7 — 271.4 


4/78 
10 


Thus there is no evidence here either of systematization. Other tests 


have been applied with similar results. 
I wish to express my thanks to Dr. Joseph Berkson for suggesting 


this survey and for guidance in its conduct. 


1.9, P = 0.03. 





0.6, P = 0.27. 





? The correlation existing between the variables has been disregarded. 











QUALITY CONTROL APPLIED TO 
BUSINESS ADMINISTRATION* 


By Ws. B. Rice 
Piomb Tool Company, Los Angeles 


HE TECHNIQUES Of quality control, as established by Shewhart,' 

Simon,” and others, are being used extensively and wita great suc- 
cess in the manufacturing problems of industry, but it is not widely rec- 
ognized that these techniques can be applied to the problems of office 
and financial management (cf., however, footnote 7). 

It has well been said that quality control is one-tenth statistics and 
nine-tenths engineering judgment. If, in considering the non-manu- 
facturing problems of business administration, executive judgment 
is substituted for engineering, it is possible to set up a criterion by 
which the usefulness of the application of quality control methods can 
be gauged. That criterion should require the purely statistical tech- 
niques to reinforce the seasoned conclusions of experienced, expert 
business managers. This test is being met successfully in several experi- 
ments conducted by the author, at the same time uncovering significant 
facts which executives can use in the direction of efficiency and profit. 

Under the pressure of war-time conditions, many firms with contracts 
negotiated at fixed prices are finding that overtime wages paid to the 
office force become a noticeable factor in rising costs, a factor for which 
it is difficult to get compensation through the process of re-negotiation. 
The preliminary data used by the author in establishing economical 
limits for office overtime pay consisted in the percentage of pay for 
overtime work to pay for regular work, by weeks, for six departments, 
from January through June 1942. 

If we think of each department as a different machine operator, and 
take for each “production sample” the figures for the four (or five) 
weeks in each month, we get six points for each of the six months, a 
total of thirty-six samples in all. The preliminary data were set up in 
tabular form as shown in the table. Beginning with July, the data were 
improved by eliminating a non-homogeneous element—the salaries of 
supervisory personnel whose compensation was on a monthly basis and 
who did not get overtime pay. Calculations made from the new figures 

* A report presented at an Engineering, Science and Management War Training Course on Quality 
Control held at Los Angeles in the Fall of 1942 under the auspices of Stanford University, and sponsored 


by the U.S. Office of Education. The original presentation has been revised to carry it through December 
1942. 

Credit and thanks are due to Professor Holbrook Working, of Stanford University, and to Dr. W. 
Edwards Deming for their inspiring help in preparing this paper for publication. 

1 W. A. Shewhart, Economic Control of Quality of Manufactured Product, D. Van Nostrand, 1931. 

2 Leslie E. Simon, An Engineers’ Manual of Statistical Methods, John Wiley & Sons, 1941. 
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for July, August, and September, raised the overall average from 13.4 
to 20.3 per cent, reflecting a truer picture of the facts; and decreased 
the average range, as was to be expected, from 12.4 to 10.6 per cent. 


RELATION OF OVERTIME PAY TO PAY FOR REGULAR WORK, 
BY WEEKS AND DEPARTMENTS 














(Per cent) 
Weeks 
Ist 2nd 3rd 4th 5th Average Range 
(X) (R) 
January 1942 
Department 1 20.4 16.6 11.4 12.2 “= 15.2 9.0 
2 28.5 2.4 3.7 7.4 -- 10.5 26.1 
3 25.0 11.4 10.4 9.7 -- 14.1 15.3 
4 9.5 16.0 8.2 11.2 —~ 11.2 7.8 
5 22.3 7.0 24.5 36.0 _— 27.4 13.7 
6 7.5 10.4 7.9 12.4 -- 9.6 4.9 
February 1942 
Department 1 17.1 16.1 7.8 9.9 — 12.7 9.3 
2 5.1 5.6 4.2 7.8 _ 5.7 3.6 
3 11.6 10.2 7.6 16.3 -~ 11.4 8.7 
4 12.2 15.5 11.6 12.0 —_ 12.8 3.9 
5 21.2 18.3 34.3 48.3 —_ 30.5 30.0 
6 15.9 15.2 11.3 15.1 _— 14.4 4.6 
Ete. 





Using the new data for analyzing past experience, and the notation 
of the American Standards Association 1942 pamphlet? the central lines 
and action limits were computed as shown below.‘ 


X =20.3 per cent 
R=10.6 per cent for samples of 4 
X+A:R=28.0 per cent and 12.6 per cent 
D;R=0 
D,R=24.2 per cent 


For several months in the first half of the year department 5 was “out 
of control” on both average and range (see chart). In June, an alert 
office manager caught the excessive overtime of department 5, hired 
more help, and promptly thereafter brought it back within normal 
limits. Needless to say, if the control chart technique had been avail- 


* American Standards Association, Control Chart Method of Controlling Quality During Production, 
Z1.3-1942 (American Standards Association, 29 West 39 Street, New York), p. 39. 

‘ The weeks containing July 4th and Labor Day were omitted because they were abnormal. Con- 
sequently, all three months contained four weeks. 

5 The quantities As, Ds, and Ds are needed for computing the control limits on the charts. Their 
numerical values are given in tables; e.g. in Simon, loc. cit., pp. 204 ff., and in the appendix to the 
American Standards publication listed in footnote 3. 
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able, indications of trouble would have been given much earlier, with 
a worth-while saving in labor costs. 

More recently department 6 has gone out of control; but the difficulty 
of finding enough trained men for a specialized type of work under 
present labor market conditions is hindering efforts to remedy the 
situation. This, however, is not so much a statistical as a personnel 
problem. 


CONTROL CHART FOR OFFICE OVERTIME 
PER CENT OF OVERTIME PAY TO REGULAR PAY 


(Each point is the average for one month) 
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Management’s opinion as to the most economical amount of over- 
time allowable varies according to circumstances. Normally, the stand- 
ard might be set at no overtime, i.e., X=0. Under war conditions, 
when volume of production is paramount and workers are hard to get, 
a certain amount of overtime work is often necessary. But the decision 
is difficult as to what limits should be set. They should certainly not 
be determined without operating facts to go on, just as engineering 
specifications for a manufactured article should not be drawn up with- 
out a knowledge of what limits inhere in the process. 

In both factory and office quality control methods develop a basis 
upon which to base executive decisions. 

How well such an application to overtime wages meets a common- 
sense criterion is indicated by the upper limits for averages derived 
from the data for July, August, and September. Now it happens that 
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this limit, 28.0 per cent (X+A2R above), is close to 30 per cent, which 
represents 20 per cent overtime in hours worked, or one full extra day 
in a normal five-day work week. In this case the management con- 
sidered it reasonable to argue that if any department had enough work 
to keep its staff busy for an extra day, economy would be served by 
hiring sufficient additional help to do the work in five days, thus avoid- 
ing the necessity of paying overtime premiums regularly.* With these 
thoughts in mind, the control limits for future data were set up as 
follows: 


For averages: Upper limit, 30 per cent 
Lower limit, 0 per cent 

For ranges: Upper limit, 24.2 per cent 
Lower limit, 0 per cent 


These control limits were applied to the last quarter of the year, i.e., 
October to December 1942. The weeks containing holidays were omit- 
ted from the calculations because, with a full working force present on 
those days, overtime payments were necessarily abnormal. Effective- 
ness of quality control is indicated by the fact that with the exception 
of department 6 and one point for department 5, all overtime was kept 
within the agreed-upon limits. Seasonal factors accounted for depart- 
ment 5’s unavoidable lapse in December. 

In this connection the range chart is particularly important. Showing 
as it does the difference between the highest and lowest week in amount 
of overtime paid for during the month, the range gives a measure of 
the skill and care with which work-flow through each department is 
planned and supervised. Extreme irregularity in work-flow shows up 
as an out-of-control point on the range chart, and can often be used to 
disprove a department head’s alibi of “too much work” when he is 
confronted with the fact of excessive overtime paid to his employees. 

The author’s observation is that, in applying quality control to 
general management, the behavior of the range or standard deviation 
chart will reveal more valuable information than the chart for averages. 

Office overtime analysis is only one example of the application of 
quality control to the interests of general management. The field is 
fertile but largely unexplored. For instance, quality control can be 
used to set norms and action limits for clerical work, both by individuals 
and by groups where, as in stenography or billing, quantitative stand- 
ards can be established. An application toward the improvement of 
clerical work, and toward reduction in costs, has been made and re- 


* With the 48-hour, six-day week established officially in many sections of the country, these con- 
clusions may need to be reconsidered. 
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ported by Dr. Deming and his staff at the Bureau of the Census.’ 
Evaluation of sales and territories, by salesmen and by lines, can be 
undertaken. Budgetary and cost accounting records are illuminated. 
Along some of these lines work now being done by the author indicates 
that, as in manufacturing, such studies do determine economic action 
limits, thereby lessening the heavy burdens of executive responsibility. 

To the many friends of quality control who are personally convinced 
of its value but who have to sell their superiors on the idea, its applica- 
tion to specific problems in business administration offers an inexpen- 
sive and powerful psychological approach. It requires no costly experi- 
mental interruptions of production; treads on few toes; invites fewer 
organizational upheavals; and by proving its worth in one field, opens 
the way logically to many others. 


1 W. Edwards Deming and Leon Geoffrey, “On Sample Inspection in the Processing of Census Re- 
turns,” this Journau, Vol. 36, 1941, pp. 351-360. 
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FAMILY VERSUS INDIVIDUAL IN MEASUREMENT 
OF AUDIENCES 


By ALFRED Po.itz 


DVERTISING RESEARCH owes its place entirely to the requirements 
A of advertising itself. Thus the value of a particular research tool 
depends entirely and cnly on its usefulness to advertisers. 

Since the best of advertising has no selling power until it is presented, 
one of the most fundamental problems to advertisers, publishers and 
radio men is to know how many people read a magazine or listen to a 
radio program. And it is in solving this basic problem that a number of 
research endeavors have adopted a method which defeats their purpose. 

It came about this way. Since most people live together in groups 
as families, there exists the idea of the family as a social unit. But this 
idea goes a step further as far as advertising is concerned. Many or most 
articles which the members of the family consume are not bought by 
all members separately, but only by one representative, for example, 
the housewife. Thus the family assumes the appearance not only of a 
social unit, but also of a single consuming unit. 

In view of this, it seemed to research men and publishers, and justly 
so, that it was inadequate to express the readership of an advertising 
medium in terms of total individuals. It seemed that there should be a 
unit of measurement which would better parallel this structure of 
consumption. So the “family base” was chosen. No specific studies 
using the family as the base need to be mentioned, since it is generally 
known that many, if not most, readership reports employ this method. 

Assume that there is a statistical universe of 24 families. For the 
sake of simplicity, each family consists of four members: Husband, 
housewife, son, and daughter. 

In this universe there are two newspapers, A and B. Expressed in 
terms of individuals, each newspaper has 48 readers. So in terms of in- 
dividuals they are equal. 

But the distribution of the individuals differs in the following way: 


A is read by 2 persons in each of 
24 different families (housewife and daughter), 


B is read by 4 persons in each of 
12 different families. 


Which paper is a better medium for tomato juice? 
According to certain research conclusions, A is better, because it 
goes into twice as many families as B. 
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The idea is that tomato juice is purchased by the housewife. If the 
ad is read by her, the purpose of the ad is practically fulfilled. Any ad- 
ditional reading by husband, daughter, or son is of only secondary value. 
These persons may speak to the housewife about tomato juice and in- 
crease her willingness to buy a certain brand, but it is obvious that such 
influence upon the final purchase is smaller than that of the housewife 
herself. 

A and B are equal, inasmuch as each is read by 48 persons. 

Newspaper A, however, which goes into 24 families, influences 24 
housewives. While newspaper B, which goes into 12 families, influences 
only 12 housewives. And so, on the “family base,” A is better than B. 

Thus there are many research men and advertisers who believe that 
the family, instead of the individual, is the right unit for describing the 
readership of newspapers and magazines and the audience of radio 
programs. But the whole family approach is based on a fallacy. 

It is true that, in this particular example, the number of families 
comes closer to the description of the relative values of the newspapers 
than does the total number of individuals. But that is more or less 
accidental and does not warrant the use of “the family” as a yardstick. 
For what really was used as the gauge was the number of housewives. 

The family is only a condition of social life which gives the housewife 
(an individual) a greater importance for buying tomato juice than the 
son (another individual). Consequently an advertiser of tomato juice 
is more interested in the housewife than in the son. 

There are other conditions, also, which determine the value of certain 
individuals. 

Some people’s habit of living in apartment houses makes the super- 
intendents, who also live there, more important to coal dealers than 
the ordinary tenants. Southerners live in a warmer climate than the 
Northerners, and so are more important to manufacturers of light 
clothing. 

There is a direct and simple device in every statistical picture which 
takes care of situations in which one type of individual is distinguished 
from other types. That is the breakdown of the total figure into specific 
ones. 

Suppose, for instance, that the total figures of readers are known for 
different magazines, and that Southerners are more important for the 
given purpose than Easterners, Westerners, and Northerners. Then, 
from the total figures, the number of Southerners within the readership 
of each magazine simply must be sorted out. 

There is a universe of people living in apartment houses. Two news- 
papers are distributed among them, and both newspapers are equal in 
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the total number of individual readers. But the particular advertising 
problem is concerned with coal. Therefore the total number of readers 
is not sufficient. The figure must be broken down into “superintend- 
ents” and “tenants.” If it appears that one paper is read by more 
superintendents than the other, it therefore has a better chance to sell 
coal. 

If lipstick is the product, the readership figures are broken down into 
females and males. For Cadillac cars, they are broken down into well- 
to-do people and not well-to-do. 

For tomato juice, the figures must be broken down into housewives 
and others. 

If paper A with 48 readers reaches 24 housewives, and B with its 48 
readers reaches only 12 housewives, then A is better than B. It is better 
because it reaches more housewives and not because it reaches more 
families. 

It is possible that the greater number of housewives may coincide 
with a greater number of families. And in the first “tomato juice” 
example, it actually did. 

Any different example, however, is equally justified. 

Suppose, as in the first example, that paper A has 48 readers in 24 


_ different families. And it is always read by two family members—in this 


case, however, son and daughter, instead of housewife and daughter. 

Paper B is read entirely as before. It has 48 readers in 12 families, 
and it is always read by all four family members, husband, housewife, 
son, and daughter. 

If, for the sake of tomato juice, the number of families indicates the 
value of a newspaper, then A is still better than B, for A still reaches 
twice as many families. 

But looking at the kind of individuals it reaches—there is not a single 
housewife involved. Although there are twice as many families, it is 
read only by sons and daughters. So it cannot have twice as great a 
value; quite on the contrary, it may have no value at all. 

Therefore as was said before, if newspaper A has more value for 
tomato juice, it cannot be on account of the greater number of families, 
but only on account of the greater number of housewives it reaches. And 
it is a mere accident if both figures happen to be in agreement. (As a 
matter of fact, they usually are in disagreement, for the very reason 
that a family is a collective of heterogeneous elements in which all ages 
and sexes are mixed.) 

The family, itself, has neither a sex nor an age nor an occupation in 
which an advertiser might be interested. It cannot read and it cannot 
buy, the two activities which are most important to an advertiser. And 
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whenever an attempt is made to use such a unit as a yardstick in meas- 
uring activities which it does not perform, but which are performed 
rather by one or two or three or more of its members, the purpose of 
the yardstick is defeated. 

If surveys using the family base speak of 1000 families which are 
reached by a newspaper or magazine, they do not define what they 
mean. Do two members of each family read the newspaper? Or does 
one member of each family read the paper? Or is a family counted if 
at least one member reads the paper? In practice the “at least one” 
definition is used. 

“At least one” means: One, or two, or three, or more. There is the 
first variable; it is not known how many people read the newspaper. 

But there is still another variable. If a paper reaches 1000 families— 
and that means it reaches either 1000 or 3000 or more people—who 
are these people? They are not housewives, for if they were, they cer- 
tainly would have been called so. Nor, for the same reason, are they 
husbands. They are, in fact, not described at all. 

In a short sentence it looks like this: A newspaper which “reaches 
1000 families” is a newspaper which is read by 1000 or 3000 or more 
people, who are either housewives or husbands or someone else. 

The variables in this definition render the “family” useless as a yard- 
stick. There is a possibility, however, of making the “family” useful 
as a yardstick by attaching to it a sufficiently rigid definition. That 
would be: A family is reached by a newspaper or radio program if each 
of its members (or adults) is reached. 

On the basis of this reasonable definition, however, what must the 
research man do with a copy which is not read by each member of a 
family, but by only one or two or three members of a family which 
consists of four? In accordance with the last definition, such a copy 
does not reach the family. 

The number of families reached would be correct and clear in itself, 
but very small. Only a minority of copies reaches every member of a 
family. So by this definition the majority of copies is neglected. In 
other words, the figure so obtained does not say anything about the 
fate of the copies, but only about the fate of some of the copies. It does 
not tell about readers of the paper; it tells only about certain readers, 
namely, those who have close relatives reading the same paper. The 
definition is technically clear but irrelevant to the purpose. This is so 
obvious that it automatically explains why this definition never was 
used. 

It must be remembered that there is a priori so much clarity and 
simplicity in expressing the value of a newspaper or magazine in terms 
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of individuals, that the inventors of the family base needed an enticing 
reason for becoming involved in it. The enticing reason was the fact 
that equal numbers of individual readers do not necessarily express 
equal values of newspapers. Paper A reaches 48 readers in 24 families 
(husband and housewife are readers). Paper B reaches 48 readers in 12 
families (husband, wife, son, and daughter are readers). For selling 
tomato juice, newspaper A is better than B. “While having the same 
total of individual readers, it reaches twice as many families.” Now this 
statement is only true if a family can be constituted by two of its mem- 
bers as well as by four. 

Thus it develops that the “family” could offer its delusive service as 
a yardstick only because of its hidden flexibility. 

It is understood that no one tried to aim at flexibility. The aim was 
to use a unit which would parallel in audience measurement the known 
facts of family consumption. Unfortunately “the family” was chosen 
as the unit, without sufficient analysis. It provided the chance to use 
comfortable terminology and gave the feeling of scientific rigor. Yet 
by its very nature it led to erroneous conclusions. 

On the other hand it requires only a very small sacrifice of prelimi- 
nary comfort to obtain and enjoy an accurate method of measuring 
reading and listening audiences. It is the individual who listens and 
reads; it is a certain type of individual whose reading and listening are 
more or less important for a particular purpose. And it is the statistical 
sorting out of these types which tells the complete story about the value 
of a radio program or a periodical. 











THE COST OF LIVING IN CITIES IN RELATION 
TO THEIR SIZE AND LATITUDE 


By E. L. THORNDIKE AND ELLA Woopyarp 


T IS REPEATEDLY ASSERTED or assumed that the cost of living is much 

greater in large cities than in small, and in northern cities than in 
southern. Both statements are false. Except for rent the differences in 
retail prices between cities of 30,000 to 44,000 inhabitants and cities of 
300,000 to 500,000 inhabitants are near zero and the difference in rent 
for a house or apartment of equal quality is less than twenty per cent. 
The differences in relation to latitude are almost nil. 

We examined the advertisements appearing September 13 to 18, 
1937, in the daily newspapers of the cities (265 in number) of over 
30,000 population and under 500,000 (in 1930) which had daily news- 
papers.'! We recorded carefully the prices found in all advertisements 
for any of the 28 items listed below, except that care was taken not to 
include more than once an apartment or house advertised on more than 
one day or by more than one agent, and that if more than twenty of any 
one size were advertised, twenty were chosen at random. 


. Monthly rent, apartment of 3 rooms, unfurnished 

. Monthly rent, apartment of 4 rooms, unfurnished 

. Monthly rent, apartment of 5 rooms, unfurnished 

. Monthly rent, apartment of 6 rooms, unfurnished 

Monthly rent, house of 4 rooms, unfurnished 

Monthly rent, house of 5 rooms, unfurnished 

. Monthly rent, house of 6 rooms, unfurnished 

. Monthly rent, house of 7 rooms, unfurnished 

. Monthly rent, house of 8 rooms, unfurnished 

10. Sale price of house of 4 rooms, unfurnished 

11. Sale price of house of 5 rooms, unfurnished 

12. Sale price of house of 6 rooms, unfurnished 

13. Sale price of house of 7 rooms, unfurnished 

14. Sale price of house of 8 rooms, unfurnished 

15. Sale price of house of 9 rooms, unfurnished 

16. Permanent wave, lowest of the advertised prices 

17. Permanent wave, median of the advertised prices 

18. Permanent wave, highest of the advertised prices 

19. Movies, average of lowest price of evening admission, and of other than 
evening admission if such is advertised 

20. Movies, average of median price of evening admission, and of other 
than evening admission if such is advertised 

21. Milk, using all firms advertising* 


CONOR wD = 


1 Except that three resort cities (Atlantic City, Miami, and St. Petersburg) were excluded and 
Milwaukee was used instead of Washington. 
2 Milk prices appeared in too few cities (only 18) to be used. 
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22. Butter, using all firms advertising 

23. Eggs, using all firms advertising 

24. Chicken, using all firms advertising 

25. Fowl, using all firms advertising 

26. String beans, using all firms advertising 
27. Potatoes, using all firms advertising 

28. Cabbage, using all firms advertising 


Expressing all prices as percentages of the median price for the thing 
in question in cities of 55,000 to 65,000, we have the facts of Table I. 


TABLE I 


THE RELATION OF CERTAIN ADVERTISED PRICES 
TO SIZE OF CITY IN 265 CITIES 




















Prices expressed as percentages of the median price for 
cities of 55,000 to 64,999 inhabitants 
Size of city ; | Renae, | Average 

Chicken| Fowl | Vese Eggs | Butter! Movies| nent | 

tables ih 

| waves 
300 ,000 to 499,999 854 94 100 103 104 100 106 98.9 
200 ,000 to 299 ,999 94 100 1104 89 103 | 102 90 98.4 
115,000 to 199,999 104 100 100 96 103 1014 99 100.5 
95,000 to 114,999 96 104 100 98 100 914 100 98.5 
75,000 to 94,999 964 93 1054 104 100 100 100 | 99.9 
55,000 to 74,999 100 100 107 98 100 100 994 100.6 
45,000 to 54,999 100 100 107 96 101 103 107 102.0 
35,000 to 44,999 100 100 | 107 104 | 100 | 85 102 | 99.7 
30,000 to 34,999 94 96 100 96 100 100 97 97.6 

















We did not make similar records and computations for the retail 
prices of staples like flour, sugar, gasoline, trade-marked goods, drugs, 
books, cigars or cigarettes, because these should be less sensitive to size 
of city than the commodities of Table I. But a rough inspection of prices 
for some of them shows no evidence that they are cheaper in the smaller 
cities. 

Expressing the median rental of an apartment or house of specified 
size in each group of cities as a percentage of the median price in cities 
of 55,000 to 65,000, we have nine sets of percentages for apartments of 
3, 4, 5, and 6 rooms and houses of 4, 5, 6, 7, and 8 rooms. We also have 
six similar sets of percentages for sales prices of houses of 4, 5, 6, 7, 8, 
and 9 rooms. The averages, when all fifteen sets are given equal weight, 
are as shown in Table II. There is a clear rise of about 10 per cent from 
cities under 55,000 to cities of 55,000 to 114,999, and a less clear rise of 
about 5 per cent from cities of 55,000 to 114,999 to cities of 115,000 or 
over. Any reasonable system of weights gives closely similar results. In 








240 AMERICAN STATISTICAL ASSOCIATION: 


so far as the apartment or house of the same number of rooms tends to 
be of better quality in the larger cities, the figures of Table II exaggerate 
the differences in rental or purchase for equally good accommodations. 


TABLE II 


THE RELATION OF ADVERTISED PRICES OF APARTMENTS 
AND HOUSES FOR RENT AND FOR SALE 
TO SIZE OF CITY IN 265 CITIES 











Average of fifteen sets of percentages for rents of apartments 
Size of city of 3, 4, 5, and 6 rooms, and houses of 4, 5, 6, 7, and 8 
rooms, and of houses for sale of 4, 5, 6, 7, 8, and 9 rooms 

300 ,000 to 499 ,999 1024 
200 ,000 to 299,999 98 
115,000 to 199,999 98 

95,000 to 114,999 91 

75,000 to 94,999 05 

55,000 to 74,999 984 

45,000 to 54,999 86 

35,000 to 44,999 86 

30,000 to 34,999 844 








The higher cost of lodging in the larger cities is not a consequence of 
higher prices for building materials. The data reported by the U. S. Bu- 
reau of Standards on the prices of brick, sand, stone, cement, lime, 
boards, 24’s, gypsum board, nails, tar-paper, glass, linseed oil and 
white lead, are in fact higher for cities under 100,000 population than 
for those from 100,000 to 200,000, and higher for the latter than for 
cities of over 200,000. Using percentages of a base-price for each com- 
modity, the averages of the thirteen percentages are as shown in 


Table ITI. 
TABLE III 


PRICES OF BUILDING MATERIALS IN 
RELATION TO SIZE OF CITY 











Sise of city Number of cities studied by Average of 13 percentages 
the Bureau of Standards for brick, sand, ete. 
Over 1,000,000 4 92.9 
500 ,000 to 999,999 6 91.7 
300 ,000 to 499,999 8 92.3 
200 ,000 to 299 ,999 9 96.5 
100 ,000 to 199,999 16 100.2 
30,000 to 99,999 11 102.6 
Under 30 ,000 3 114.9 











A critical study of the costs of lodging, foods, carfares, electric current 
and other commodities and services in various cities reported by the U.S. 
Bureau of Labor Statistics and the National Industrial Conference 
Board reveals that the discrepancies between their findings concerning 
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size of city and ours are not large and may be explainable by differences 
in quality. 

The advertised prices in the 265 cities show no clear differences in 
relation to latitude, either for foods, movies, etc., or for rent. The medians 
of eleven percentages summarizing both for the groups at 44° or more, 
43°, 42°, 41°, 40°, 39°, 38°, 36° and 37°, 33° to 35°, and south of 33° are, 
in order, 96, 91, 100, 100, 102, 100, 98, 102, 93, and 88. 

In the prices for domestic use of electricity reported for 51 cities by 
the Bureau of Labor Statistics there is little or no correspondence with 
latitude. The percentages for 12 cities north of 42°, 13 cities at 40° or 
41°, 13 at 37° to 39°, and 13 south of 37°, are in order 102, 104, 90, and 
105. The prices of building materials are a little lower in the most north- 
erly cities. 











EXPERIENCE WITH TESTS OF SIGNIFICANCE: 
A REPLY TO PROFESSOR R. A. FISHER 


By JosepH Berkson, M.D. 
Section of Biometry and Medical Statistics, Mayo Clinic* 


ROFESSOR R. A. FISHER in his note on my article “Tests of Signifi- 
oo Considered as Evidence”! finds it pertinent to say, “It is not 
my purpose to make Dr. Berkson seem ridiculous, nor, of course, to 
prevent him from providing innocent amusement,” and to make some 
biographical comments of a similar intention. This is no time to begin 
calling names across the sea, and however strongly I may differ with 
Professor Fisher in regard to scientific questions, I shall confine my 
differences to the subject matter discussed. 

My present attitude toward statistical tests of significance is the re- 
sult of more than fifteen years of experience in their application to a 
large variety of data. I have no desire to undermine them unwarrant- 
ably. On the contrary, since my profession is that of consulting statisti- 
cian, I should like to possess a tool in the use of which I was a specialist 
and which could serve well those who engaged my services. But if I 
have found these tests misleading in certain applications, it is appropri- 
ate that I expose this fact in statistical journals for the view of my col- 
leagues. I may be wrong, but I am not such a fool that I cannot repeat 
as well as anyone who has ever memorized a textbook, the argument 
about the null hypothesis and the small P disproving it. At the begin- 
ning of my career I quoted the orthodox view. But a consulting stat- 
istician in a practical institution finds himself under the necessity of 
reconciling what he says to those who employ him with what they find. 
Time and time again I came upon actual situations in which (1) it was 
patent to all competent people who viewed the data, that the null 
hypothesis, on the evidence, was not true, but a small P did not result, 
(2) a small P did result, but it was obvious to all who examined the 
data, that on the evidence the null hypothesis was substantially true, 
or (3) when the null hypothesis was tested by several tests, each put 
forward with equally good authority, the P’s were considerably differ- 
ent. 

Reflection on this situation led me to the conclusion that the argu- 
ment by which tests of significance are usually supported is basically 
unsound. The fallacy lies fundamentally in the argument’s not con- 


* At present on leave, Lt. Col., Medical Corps, Army of the United States, Chief, Statistical 
Division, Office of the Air Surgeon. 

1 Joseph Berkson, “Tests of Significance Considered as Evidence,” this Journat 37: 325-335 
(September) 1942. R. A. Fisher, “Note on Dr. Berkson’s Criticism of Tests of Significance,” this 
JOURNAL, 38: 103-104 (March) 1943. 
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sidering the alternative to the null hypothesis. I have found that the 
quite numerous instances of false or paradoxical conclusions of my 
experience are resolved, if the view is adopted that what the result ob- 
tained from a test of significance does is not to “disprove the null 
hypothesis tested,” nor even to give a basis for “rejecting” it but rather 
that the finding affords evidence in favor of something—in favor of some 
alternative. In ordinary rational procedures and in scientific inquiry 
too, one posits explicitly or implicitly a set of alternatives, A and not 
A: for instance, the patient has syphilis or he does not have syphilis; 
there is no difference in means or there is some difference; the dis- 
tribution is mesokurtic or the distribution is heterokurtic; the regres- 
sion is straight or the regression is curved. When the P of an appropriate 
t test is small, according to my view, one does not “disprove” that 
there is no difference in means; one has affirmative evidence in favor 
of the alternative, that is, that there is such a difference. I say this con- 
stitutes such evidence, because in the circumstances, a small P would 
occur frequently (relative to the level of significance) if there did exist 
a difference between the means. 

The small P’s of tests are evidence for any alternative in which they 
occur frequently. Good tests are ones in which (1) when a small P oc- 
curs it is evidence for an alternative in one particular pair of contraries 
and (2) when this alternative is true, a small P will occur very fre- 
quently. Such a test might be called highly specific and sensitive. An 
example is the ¢ test appropriately performed with ample numbers for 
a difference between the means of two normal distributions. When with 
such a test a small P results, it points to nothing except that there is a 
difference in means; if there is a substantial difference in means, such 
a P is likely to result.2 Tests which Neyman and Egon Pearson call 
“universally most powerful” tend to be specific in this sense. When 
with such specific tests one has sufficient numbers, they become sensi- 
tive tests; in the terminology of Neyman and Pearson they become 
“powerful.” I have suggested the generalization of which I feel only 
fairly sure, that highly specific tests are always equivalently statable 
in terms of an estimate and its confidence limits, and that it is better 
to state them in that way. There are being widely used in statistics 
today tests that are not good tests, in the sense mentioned, and these 
are the ones that I have been questioning. Examples are (1) the fre- 
quency chi-square test and (2) many applications of analysis of vari- 
ance. 

If the view is adhered to that what the small P of a test of signifi- 


2 J. Neyman and B. Tokarska, “Errors of the Second Kind in Testing ‘Students,’ Hypothesis, 
this JourNaL 31: 318-326 (June) 1936. 
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cance does is to “disprove the null hypothesis tested” then not only 
will these latter tests be pocr tests in being nonspecific or insensitive 
or both; they may be positively misleading. The case I cited of Profes- 
sor Fisher’s application of the technic of analysis of variance as a test 
for linearity is a case in point. This example was not brought out in 
isolation to point up anyone’s individual blunder, but was part of a 
continued discussion of the pitfalls into which the pursuit of an errone- 
ous conception of tests of significance could lead one. It was only one 
of many examples that I have published, mostly from my own ex- 
perience. Professor Fisher applied* what he titled a “test of straightness 
of regression line” to some data of Hersh,‘ of eye facet number of 
Drosophila in relation to temperature, obtained a small P and con- 
cluded that the regression is nonlinear. He insists in his recent. note that 
this is so, because this, he says, is the objective way of determining that 
it is not linear. Now asa naive desk neighbor of mine said when I read 
him the results, “If it isn’t linear, what is it?” A plot of the points and 
a least square fit did not indicate any curvature that I or anyone else 
to whom I exhibited the graph was impressed with. If I were to pre- 
dict facet number from a regression on temperature for these data, 
I should use a linear regression. On these grounds I wrote: “On in- 
spection it appears as straight a line as one can expect to find in bio- 
logical material.” Professor Fisher thinks this judgment of the lack of 
curvature, instead of the reverse judgment based on the small P of his 
test, is obtuse and authoritarian. But I presented the evidence to the 
reader and I gave another explanation of the small P. 

I do not adhere to the doctrine that a small P disproves the “null 
hypothesis tested,” in this case that the regression is linear. I have at- 
tempted to demonstrate in the article mentioned that, while this prin- 
ciple appears plausible, it is actually illogical and at variance with the 
usual mode of scientific demonstration. I do not think that a logician 
would support the proposition that “If an event known to occur rarely 
in hypothesis H, occurs, hypothesis H is disproved,” and no validating 
experiments have been brought forward to support it either. In practi- 
cal fields of investigation in which a test, 7, is utilized, as for instance 
when a serologic reaction is used for the diagnosis of syphilis, validation 
of the test involves the determination of the frequency of T in (a) nor- 
mal persons (null hypothesis) and (b) syphilitic patients (alternative). 
No one in this field would support such a test by demonstrating merely 


*R. A. Fisher, Statistical Methods for Research Workers. Ed. 7, Edinburgh, Oliver and Boyd, 1938, 
356 pp. 

4 A. H. Hersh, “The Effect of Temperature upon the Heterosygotes in the Bar Series of Drosophila,” 
Journal of Experimental Zoology, 39: 55-71 (February) 1924. 
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that if the population is normal, 7 occurs only rarely. Such an argu- 
ment would appear absurd. What is wanted also, and indeed what is 
wanted mainly, is information as to how often T occurs if the popula- 
tion is syphilitic. It is this that determines the usefulness of the test for 
its purpose, the diagnosis of syphilis. 

Why should the validity of statistical tests be immune to such critical 
appraisal as is required as a matter of course in any other field of ap- 

lication of tests? Analogous validation for statistical tests of signifi- 
p g 
cance would involve determining, first in what alternatives, and then 
how often the critical value of the test function occurs. Before the work 
of Neyman and Egon Pearson, no such inquiry was made for any 
statistical test of significance. Even now it has not yet been made for 
the tests that I have been questioning, despite raucous and repeated 
challenges from me to the mathematicians; for instance, it has not been 
made for all phases of the test of linearity of Fisher which we are dis- 
cussing. Examples of questions which would be answered by such an 
inquiry are: What is the probability of obtaining a value of P<0.05 
with this test if (a) the abscissal variate z is not subject to variation, 
the ordinate variate y normally distributed with uniform variability 
o,, and the means of 7, follow some given nonlinear regression, (b) the 
regression is linear, the abscissal variate not subject to variation but 
the variability of y, c, is a given function, not constant, (c) the regres- 
sion of means is linear, ¢, constant, but z not constant but subject to 
normal variation with o,? 

I say the small P is evidence for any alternative hypothesis in which 
it will occur frequently (relative to the level of significance). In the 
instance in hand I suggested three® specifically and others generally: 
(1) curvilinearity as against linearity, (2) heteroscedasticity as against 

’ P.C. Tang, “The Power Function of the Analysis of Variance Tests,"’ Statistical Research Memoirs. 
Edited by J. Neyman and E. 8. Pearson, Department of Statistics, University College, London, De- 
eember, 1938. 

* At several points Professor Fisher achieves miscomprehension only with great effort. He says: 
“What Dr. Berkson fails to realize is that the judgment, from inspection, that the line appears as straight 
as one ought to expect, would have given no aid whatever towards discovering the cause of any real 
anomaly, whatever the cause might be, because that judgment in effect denies the evidence that any 
real anomaly exists.” The fact is that it was I, not Professor Fisher, who suggested the multiplicity of 
possible meanings of the small P in this application. I said: “ . . . a small P is evidence in favor of curvi- 
linearity ..., also . . . in favor of linearity plus heteroscedasticity . . . the temperature . . . subject to 
fluctuation ... and... other conditions which . . . produce a small P relatively frequently . . . Which 
of these shall be taken to have been demonstrated . . . will have to be determined by other evidence, 
possibly other statistical tests.” It is hard to imagine how I could have been plainer on this point. The 
possibility of a “remarkable discontinuity . . . in the neighborhood of 24°C.,” not mentioned in Professor 
Fisher's textbook illustration, which is what I was commenting on, is provided for in the last part of 
my statement. We were discussing, however, if it requires recollection, not a test for anything at all, 
not just to see whether something was the matter. We were talking about a test for “straightness of re- 


gression line,” that is a device by which we were to discern whether the regression was linear or curvi- 
linear, and a specific application of it in which the regression had been found definitely to be nonlinear. 
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homoscedasticity, (3) variable temperature as against constant tem- 
perature, and (4) other conditions which will produce a small P. I 
gave as my own preference, in view of the fact that (to me) there was 
no confirmation of the nonlinearity, that the temperature was fluctuat- 
ing. Examination of the figure suggests this explanation as especially 
likely if the temperature was variable at what was recorded as 23 or 
25°C. or both. Professor Fisher says that, if I had read Hersh’s article 
from which the data were obtained, I should not have committed the 
blunder which I made. I had read the article and it amply confirms 
my suggestions of possible alternative causes of the small P. There is 
definite evidence of heteroscedasticity in table 6 of the article, and as 
for fluctuation of temperature, here are Hersh’s words (my italics): 


Errors in this study may arise from fluctuating temperatures. The fluctua- 
tions were not the same in all cases; 15°, 17°, 25°, and 27° showed a variation 
of +0.5°. 23° varied as much as 3°. The record of 29° showed a variation from 
28.0° to 29.5°. Temperature readings taken of the food by a thermometer 
graded in tenths of degrees, showed a total variation of 1.4° in the 31° 
aquarium, and of 1.3° in the 21° aquarium. The 19° aquarium was more 
unsatisfactory and varied sometimes as much as 5°. In every case, however, an 
attempt was made to have the proper temperature during the facet reaction 
period, though this may not have been accomplished because the period may 
not be the same for the heterozygotes as for the homozygotes for which it 
has been determined. Nominal temperatures in accompanying tables and 
figures are to be considered with these remarks in mind. 














EDWARD LEWIS DODD, 1875-1943 


HE SUDDEN DEATH of Edward Lewis Dodd on January 9, 1943, at 

his home in Austin, Texas, following a heart attack, brought to a 
close a distinguished career. He had long been a member of the Amer- 
ican Statistical Association and had just been elected a Fellow of the 
Association, though the news of his election did not reach Austin until 
shortly after his death. 

Dr. Dodd was born in Cleveland, Ohio, November 1, 1875, the son 
of Heber and Emma Melvina (Drake) Dodd. He was educated in the 
Cleveland public schools; in Western Reserve University, from which 
he received the degrees of B.A. in 1897 and M.A. in 1901; and in Yale 
University, from which he received the degrees of M.A. in 1902 and 
Ph.D. in 1904. After serving as instructor in mathematics for two years 
at the State University of Iowa and one year at the University of IIli- 
nois, he came to The University of Texas as instructor in pure mathe- 
matics in 1907, and rose through the successive ranks of adjunct pro- 
fessor (1914), associate professor (1917), and professor (1923) of actu- 
arial mathematics. In the year 1922-23, he taught at Williams College 
as visiting assistant professor of mathematics. On June 15, 1910, Dr. 
Dodd was married to Miss Alice Laidman of Austin, Texas. His wife 
survives him. 

Dr. Dodd was a member of the honorary societies of Phi Beta Kappa 
and Sigma Xi and of various professional societies in addition to the 
American Statistical Association, including the American Mathemati- 
cal Society, the Mathematical Association of America, the Econometric 
Society, and the Institute of Mathematical Statistics (of which he was 
a Fellow). He was a councilor of the Mathematical Association of 
America in 1920, and Associate Editor of the American Mathematical 
Monthly from 1918 to 1923. 

His main fields of work were probability, statistics, and actuarial 
mathematics. During its early days he was a member of the Actuarial 
Advisory Committee of the Teachers Insurance and Annuity Associa- 
tion of America. He also gave much assistance in connection with the 
establishment of the Teachers’ Retirement System of Texas. 

Of the many honors which came to Dr. Dodd in recognition of his 
attainments in his chosen field, two will be mentioned here. He was 
chosen as the official representative of the Mathematical Association of 
America at the Seventh American Scientific Congress held at Mexico 
City, September 8-17, 1935. He was one of the two Americans who were 
invited to participate in the International Congress on the Theory of 
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Probability held at Geneva, Switzerland, October 11-15, 1937, and 
was the only American who actually did take part in it. 

Dr. Dodd was an inspiring teacher, a distinguished scholar, and a 
master of productive research. A list of his publications, in addition to 
numerous reviews, abstracts, and minor papers, includes some forty 
items. But to those who knew him best, he will also be remembered for 
his sincere friendliness, his unfailing courtesy, and genuine kindliness, 
and his being always the Christian gentleman. 





C. D. Simmons 
The University of Texas 


THE STUDY OF CYCLES 


HE FOUNDATION for the Study of Cycles—a non-profit organization 
cae to foster, promote, and conduct scientific research in re- 
spect to rhythmic and periodic fluctuations in any branch of science— 
announces the offering of a medal to the person who, during 1943, pub- 
lishes the book or paper in this field that in the opinion of the judges is 
the most outstanding. 

The judges are Dr. C. G. Abbot, Solar Physicist and Secretary of the 
Smithsonian Institution; Dr. Harold E. Anthony, Curator of Mammals 
and Dean of Scientific Staff, American Museum of Natural History; 
Professor W. C. Mitchell, Professor of Economics, Columbia Univer- 
sity, and Director of the National Bureau of Economic Research; Pro- 
fessor V. C. Wynne-Edwards, Professor of Zoology, McGill University; 
and Professor Ellsworth Huntington, Professor of Geography and 
Climatology, Yale University (Chairman). 

It may be in point to recite some of the reasons for assigning high 
importance to research in cyclic phenomena. First, in dealing with 
rhythm and periodicity the scientist is at the very heart of predictabil- 
ity. The power to predict accurately is the acid test of the degree of 
precision reached in any science. Medical science, for example, would 
become vastly more effective if we understood the degree to which 
various diseases are rhythmic in occurrence. Similarly, accurate long 
range weather predictions, which would be of inestimable value in agri- 
culture and aviation, depend on a knowledge of the nature and causes 
of the more or less rhythmic variations observed now and in the past. 

Second, the techniques for dealing with rhythms and periodicities 
are much the same in one branch of science as in another, but unless 
the subject is considered a field by itself, advance in methods in one 
discipline is often unknown in another until after the lapse of many 
years. 
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Third, the subject of rhythmic and periodic fluctuations is important 
because identical rhythms in unrelated phenomena and unrelated fields 
of science suggest possible interrelationships that might otherwise 
escape notice. For instance, long before the cause of tides was under- 
stood, the fact that the period of the tides is exactly half the period of 
the moon suggested that the moon must have something to do with the 
ebb and flow of the waters. 

These are some of the reasons why a concerted study of cycles as a 
whole is important. There are of course other reasons in respect to in- 
dividual branches of science. 

We must all be careful not to confuse contributions to our knowledge 
of interrelationships with contributions to the knowledge of rhythms 
and periodicities per se. There may be an excellent correlation between 
two phenomena, and yet no rhythms may be involved in either. The 
direct concern of The Foundation for the Study of Cycles is only with 
rhythms and periodicities as such. We must also keep in mind that 
denial of the existence of a rhythm may be as important as affirmation. 

In addition to awarding a medal, the Foundation will make awards 
to outstanding work in each branch of science. In these awards the 
judges will be assisted by the recommendation of an advisor appointed 
by a scientific society in the branch involved. For the American Statis- 
tical Association Dr. W. Edwards Deming, of the United States Bu- 
reau of the Census, has been designated. 

The person who receives the medal and the persons who receive the 
awards will also be elected Fellows of the Foundation for the Study of 
Cycles. 

Communications should be addressed to Professor Ellsworth Hunt- 
ington, Chairman of the Committee on Awards, Hendrie Hall, Yale 
University, New Haven, Connecticut. 


A CORRECTION 


In the memorial note of Norman John Silberling in the March 1943 
issue of this JouRNAL, page 112, an error of omission was made. Dr. 
Silberling at the time of his death was Professor of Business Research 
at Stanford University. 








BOOK REVIEWS 


GLENN E. McLAvuGHLIN 
Review Editor 


Group Differences in Urban Fertility, by Clyde V. Kiser. Baltimore: The 
Williams and Wilkins Company. 1942. xii, 284 pp. $2.50. 


This study, by a member of the Technical Staff of the Milbank Memorial 
Fund, forms an appropriate conclusion to an important series of investiga- 
tions under private auspices directed toward the discovery and description 
of group differentials in fertility in the United States. Such investigations, on 
large scale and at high scientific level, were initiated by the same Fund a 
decade and a half ago. This study is the last of its kind, not because the prob- 
lem can henceforth be neglected, but because its importance is now suffi- 
ciently demonstrated to be recognized as a matter of national interest and 
included in the agenda of the Federal Census. Private research in this field 
will henceforth be directed to related problems which are beyond the scope 
of census procedures, such as study of conditioning factors and motivation, 
and the elaboration of implications for social theory and policy. 

It is apparent from Kiser’s study and other investigations that the char- 
acter of group differentials in fertility is undergoing an important modifica- 
tion. The general pattern of these differentials is not radically different today, 
but the monotonous iteration in the findings of various investigations is be- 
ginning to disappear. This is due, in part, to the development of more re- 
fined techniques; but it is also due in part to actual change in the phenomena 
themselves. As Ogburn has pointed out, differential fertility is in part due to 
lags in the diffusion of the family limitation pattern. So long as mere cultural 
distance from experimental science dominated the whole situation, the in- 
verse relation of social status to fertility was fairly ubiquitous. In some cities, 
e.g., Boston, social gradients in fertility show little modification. This is true 
in general for all the large industrial cities in the Northeast. But in many 
parts of the country, group differentials in fertility show less uniformity and 
greater sensitivity to a variety of changing social conditions. Their accurate 
description will, therefore, require analysis of comparable data in different 
areas, under different conditions, and at different times. 

Persons not concerned with the technical aspects of demography will 
generally await the publication of the forthcoming Census report on this 
subject for a description of the present status of group differences in fertility 
in the United States. This study by Kiser, however, explores some relations 
that will not be covered by the Census report. It also illuminates, in the 
course of the investigation, many other interrelations between occupational 
status, education, income, race, nativity, and locality. 

Demographers will study this work primarily for its contributions to 
methodology. Many possible sources of bias in the interpretation of data on 
fertility and means of controlling such interpretations are fully developed 
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by the author and need no further elaboration. It seems to this reviewer that 
the limitations of marital fertility rates based on statistics of births in any 
limited period and the bias inherent in the standardization of marital fertility 
rates if interpreted as measuring total marital fertility are not sufficiently 
emphasized by the author. A standardized marital fertility rate necessarily 
eliminates the influence of differences in the span of the exposure period, 
which is one of the major factors influencing group differentials in fertility. 
Also, as the author points out, age-specific marital fertility rates, unless con- 
trolled for order of birth, are appreciably influenced by the length of the pre- 
ceding interval since marriage, and the mean length of this interval at any 
given age varies widely among different groups. Similar bias, in lesser degree, 
affects standardized frequencies of number of children ever born to married 
women. Age-specific fertility rates supply valuable information on factors af- 
fecting fertility. However, the most accurate index of total marital fertility is 
obtained from data on children ever born to ever married women in age 
classes near the end of the childbearing period, and preferably with clas- 
sification by criteria such as race, place of birth, education, etc., which remain 
constant or fairly constant through the whole childbearing period. This sug- 
gestion is emphasized here because of its possible bearing on census proce- 
dures; but it is likely to encounter some resistence from tidy statisticians who 
dislike the rejection of any available data that may seem on the surface to be 
relevant to the problem at hand. 

The predilection of statisticians for the use of all collected data and the 
impossibility of carrying through all possible breakdowns with a limited 
body of material may be responsible for the use of the ambiguous category, 
“colored.” The behavior, as regards fertility, of the Negro population is 
radically different from that of Mexicans, Chinese, and Japanese. There 
were many of the latter groups in the western cities included in the National 
Health Survey on which this study is based. Limitation of the data for non- 
white populations to those for Negroes, except where separate treatment of 
other ethnic-cultural groups was possible, would have obviated some of the 
author’s constant difficulties with this material, somewhat analogous to those 
of a scientist dealing with zoo data on the feeding habits of seals, snakes, and 
“quadrupeds.” This suggestion also has possible implications for census 
procedure. 

The treatment of pregnancy wastage, which is in general a very useful 
contribution, is handicapped, as the author points out, by the gross under- 
reporting of abortions and its possible influence on apparent group differen- 
tials. For example, the relatively high frequency of pregnancy wastage ir the 
case of women with college education in the professional group (found to be 
statistically significant) might possibly be due to fewer inhibitions in this 
group against telling the truth on this subject. 

An adequate description of differences in fertility in the United States 
requires a vast body of data specifically designed for this purpose. This is 
clearly shown by the difficulties which a skillful scientist encounters in the 
attempt to use for this purpose data incidental to another study. Dr. Kiser 
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and his predecessors and associates at the Milbank Memorial Fund are to be 
congratulated for blazing the trail, developing the necessary techniques, and 
demonstrating the possibility and significance of such studies. The Fund has 
already embarked on the still more difficult but no less important task of 
studying the conditions which determine such differentials and the possibili- 
ties of social control. 

FRANK LORIMER 


American University 


Contraception and Fertility in the Southern Appalachians, by Gilbert Wheeler 
Beebe. Baltimore: The Williams and Wilkins Company. 1942. xii, 274 pp. 
$2.50. 


When the millennium arrives with populations of optimum size maintained 
in every resource area of the world, the historians of that era should give 
due credit to the basic research in techniques and measurement of the control 
of human fertility which has been sponsored by the National Committee on 
Maternal Health and the Milbank Memorial Fund. Beebe’s book follows 
closely the type of contribution made by Stix and Notestein in Controlled 
Fertility, with some further development in techniques for measurement and 
analysis of phenomena relating to contraception and fertility, and with ade- 
quate reporting of further experimental work. 

Logan County, West Virginia, was the locale chosen for the three-year 
experiment reported. Within the county a group of women in the rural- 
nonfarm population, mostly wives of coal miners, were “enlisted” as contra- 
ceptive patients by a nurse working under supervision of local doctors. The 
technique advised was that of vaginal jelly alone with jelly and applicator 
supplied at no cost to the patient. For approximately 1,300 women who par- 
ticipated in the experiment, the chance of conception (as measured by num- 
ber of second and subsequent conceptions per 100 woman-years of exposure 
to the risk of conception) was reduced by nearly one-half, from 73 to 40 for 
whites and from 82 to 45 for Negroes. 

Such results form only the starting point of the detailed analysis presented. 
First is given an over-all review of the economic and social conditions of the 
Southern Appalachians of the sort that has been given before by regional 
sociologists and agricultural economists. Next follows a description of the 
project with particular attention to the problems of sampling involved. Like 
many other “experiments,” this one began with scant concern for the repre- 
sentativeness of the individuals included in the study, but like few others, 
before the study was concluded, every effort was made later to explore their 
representativeness of rural-nonfarm women in the county. In fact, a major 
contribution to this line of research is Beebe’s more careful consideration 
than has hitherto been evidenced of the importance of sampling procedures 
in the design of a study and of sampling errors in the drawing of conclusions 
from results. 
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The description of the group of women who participated in the experiment 
is interesting because they manifested in an exaggerated degree the traits 
common to the Southern areas of high fertility. Their mean age at first 
marriage was about seventeen and a half years, their admitted rate of pre- 
marital conceptions was 15 per cent for whites and 37 per cent for Negroes, 
their rate of first conceptions was such that 80 per cent had conceived within 
the first 10 months of marriage. From an auxiliary study which was made 
near the end of the experiment, it was found that about 32 per cent of the 
rural-nonfarm women of reproductive age in the county would accept con- 
traceptive advice, but that these women because of higher fertility were re- 
sponsible for about 50 per cent of the births of the county. 

Hence the experiment demonstrates that similar programs on similar 
populations could reduce by about 40 per cent the fertility of a group of 
women who account for about 50 per cent of the county’s births, or could 
reduce the total birth rate 20 to 25 per cent. 

The conclusions and recommendations for the establishment of effective 
programs of contraceptive services again follow the Stix and Notestein lead 
of emphasizing the advisability of providing more than one technique to 
patients. This emphasis on the acceptability of a particular method to the 
patient rather than on the efficacy of a method under laboratory conditions 
has important implications for the planning of an actual clinic program. 

MARGARET JARMAN HaGoop 

University of North Carolina 


Public Health Statistics, by Marguerite F. Hall. New York: Paul B. Hoeber, 
Inc. 1942. xxii, 408 pp. $5.50. 


In recent years there has been a virtual wave of new books on business 
statistics, social statistics, and general statistical methodology. Although the 
late Professor Pearl’s third edition of Medical Biometry and Statistics ap- 
peared in 1940, the fact remains that new books devoted to vital statistics 
and biometry have been rare. This uncrowded condition of the field itself 
will probably ensure wide attention to Dr. Hall’s Public Health Statistics. 

The author states that this book “is intended for use as a text for a basic 
course in Public Health Statistics and as reference material in health depart- 
ments.” The introductory chapter contains a conventional but good discus- 
sion of history, principles, and limitations of statistical methodology. This is 
followed by three useful chapters on principles of computation and handling 
of mass data and tabular and graphical presentation of the materials. From 
this point the book can be said to fall into two major divisions, vital statistics 
and general statistics, although not so labeled. The vital statistics section 
includes Chapters V-XIV (pp. 88-259) respectively entitled population 
data; estimates of populations; standard millions; vital events; rates; mor- 
tality rates; vital statistics other than mortality; variation and trends in 
rates; ratios and index numbers; and construction of a life table. Chapters 
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XV-XXI (pp. 260-372), devoted mainly to general statistical methodology, 
are respectively labeled mass data—preliminary definitions; measures of 
central tendency; measures of variability; measures of relationship and trend; 
theory of probability, point binomial, normal curve; measures of reliability, 
and prerequisites to the application of statistical methods. 

The author probably had good reasons for the tandem arrangement of the 
materials for vital statistics and those for general statistics, but the rewiewer 
fails to see what they were. If separation was necessary it would seem that 
the chapters on vital statistics should follow rather than precede those on 
general statistics. In the reviewer’s opinion, however, better integration of 
the materials would be in order for a book on public health statistics. As one 
might expect, the present arrangement resulted in virtual duplication of the 
discussion in several instances. 

The author avowed a conscious effort to prepare a book that could be used 
by people with little training in mathematics. With commendable simplicity 
of language and detail she leads the reader through the step-by-step proce- 
dures in specific computations. Dr. Halbert L. Dunn correctly states in the 
foreword, “She has discovered the knack of making probabilities, chi-square 
rates and index numbers into everyday county, city, and state health lan- 
guage.” This characteristic will doubtless enhance the value of the volume as 
a handbook for use in health departments. 

In the reviewer’s opinion, however, the volume does not measure up so 
well as a textbook. For this purpose one would look for adequate discussion 
of underlying theory as well as for clarity in the exposition of technique. 
Unfortunately, the discussion of theory tends to be sketchy. It is possible 
that in writing the book the needs of the health department rather than those 
of the classroom were uppermost in the author’s mind. Whatever the ex- 
planation, the author tended to devote much more attention to the mechanics 
of computation than to the meaning of the statistical measures considered. 
On several occasions the reviewer was somehow reminded of the lines, “Theirs 
not to reason why, Theirs but to do and die.” 

Despite the above characteristics there is ample evidence of the author’s 
own well-balanced point of view regarding the place of statistical technique 
in public health work. She emphasizes strongly the limitations of statistical 
methodology and the necessity of knowing at first hand the characteristics 
of the basic data and the conditions under which they were collected. 

CiypeE V. KIsER 


Milbank Memorial Fund 


Refugee Settlement in the Dominican Republic, A Survey Conducted under the 
Auspices of the Brookings Institution. Washington: The Brookings Insti- 
tution. 1942. xvi, 410 pp. $4.00. 


In 1938 at an intergovernmental conference on the refugee problem, the 
Dominican Republic, alone among thirty-two nations, declared its willingness 
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to welcome large numbers of refugee settlers. The limit was set at 100,000. 
This offer, the one bright spot in a gloomy picture, speedily led to the forma- 
tion of the Dorsa, the Dominican Republic Settlement Association, a social 
agency financed from private funds, and to the establishment on the north 
coast of the Dominican Republic of the first refugee colony of under 500 
persons at Sosua, a 26,000-acre estate offered to the Association by Generalis- 
simo, now again President Trujillo. 

This survey is limited to the activities and prospects of the Dorsa; it does 
not extend to other refugees, among whom Spanish loyalists are particularly 
numerous. Moreover, the survey confines itself strictly to economic aspects 
of the problem, omitting social, political, and other non-economic phases. 
Nevertheless, this book is far more than the study of a single local settle- 
ment. It is the most comprehensive study of the economic life of the people 
of the Dominican Republic known to the reviewer and as such constitutes a 
worthy companion volume to the Brookings Institution survey Porto Rico 
and Its Problems, 1930. The success or failure of the Dorsa and its protegés 
is inextricably tied up with the economic life of the Republic. 

This economic survey of the Dominican Republic makes up the bulk of the 
study. It is preceded by three chapters dealing with the refugee problem in 
general and is followed by four chapters devoted to the work and problems 
of the Dorsa in particular. 

The principal questions which the survey had to consider were: (1) What 
is the relation of the Dominican project to the refugee problem as a whole? 
(2) Is the Dominican Republic a suitable place for refugee settlement? 
(3) How may refugee immigrants make a living there? (4) How many settlers 
can the Republic absorb? (5) What will be the effect of the settlement project 
on the Dominican community? 

Of these five questions the third and fourth are of most immediate and 
practical importance. The answers given are none too encouraging. Refugee 
immigrants may aspire to the level of peasants, i.e., agriculturists devoting 
most of their energies to subsistence farming with cash crop farming or 
primitive industry definitely supplementary and secondary. The Republic 
can absorb according to the majority of the survey staff from 3,500 to 5,000 
persons and, according to one member, up to 10,000 refugee settlers. 

Those familiar with the desperate conditions prevailing throughout the 
Caribbean area, the answer to question three can hardly be surprising. But 
the low absorption capacity of the Dominican Republic does come as a shock 
to those who compare the Dominican rural population density of 68 per 
square mile with that of Puerto Rico, Haiti, and Jamaica, which in the ag- 
gregate is over four times as high. The low arability coefficient, the fantastic 
rate of increase of the population, and the land requirements of the refugee 
peasants, which far exceed those of the native rural proletariat, offer the 
explanation. 

It will cost not much less than $4,000 to settle a refugee on the island and 
of that he is expected to pay back about $1,600. At that rate the settlement 
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of 5,000 “units” would require an outside net contribution of roughly 12 
million dollars. 

Apart from the chapter on the history of the Republic which reads like a 
dime novel, the book is written in a sober, almost sombre tone. It is an in- 
valuable study of a local problem with global implications. It is required 
reading for all those wrestling with the problems of the post-war world. 

The map of the Island of Hispaniola facing the title page might have been 
enhanced in usefulness by the insertion of names prominently mentioned in 
the text. The absence of an index is not a serious flaw in a survey carefully 
outlined and skillfully articulated. 

Ericu W. ZIMMERMANN 

The University of Texas 


Consumer Instalment Credit and Economic Fluctuations, by Gottfried Haber- 
ler. New York: National Bureau of Economic Research. 1942. xix, 239 pp. 
$2.50. 


This volume is the ninth of the studies in consumer instalment financing 
prepared under the financial research program of the National Bureau of 
Economic Research. In it data gathered in the previous studies are used as 
the basis for a theoretical analysis of the relations between consumer instal- 
ment credit and cyclical fluctuations in general business. 

Consumer instalment credit both in the form of commodity credit and 
cash loans shows a pronounced cyclical pattern when measured by any one 
of the series used for this purpose, volume of new credit granted, amount of 
credit outstanding and “net credit change.” The causes of fluctuations in 
instalment credit are found in both long run growth factors as well as cyclical 
factors. Cyclical fluctuations in consumer credit are caused more by cyclical 
shifts in demand than by cyclical shifts in supply. Responsiveness of demand 
to changes in the terms of credit is more dependent upon changes in the down 
payment percentage and contract length than upon changes in the rate of 
interest charged. 

The introduction and expansion of consumer instalment credit in the 
1920’s had a stimulating effect upon the economy in the short run. Whether 
this development is to be regarded as favorable or unfavorable for economic 
stability in the long run depends upon questions of business cycle theory on 
which expert opinion is still divided. Haberler’s conclusions are as follows. 
After the introduction and development to maturity of instalment credit as 
an institution it is still subject to cyclical fluctuations but these fluctuations 
are then the result rather than the causes of cycles in general business activ- 
ity. However, consumer credit does act as an amplifier by intensifying the 
movements of the general cycle whether up or down. Yet this behavior is in 
no way different from that of credit in general. 

In comparison with other influences upon general business activity con- 
sumer instalment credit was not very important during the decade 1929-40, 
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much less important, for example, than the financial operations of the Federal 
Government. Yet consumer credit has a greater significance than this com- 
parison would suggest when it is considered in relation to the development of 
particular industries and their effects upon the economy as a whole. Con- 
sumer credit as an institution should be maintained, but efforts should be 
made either to mitigate its cyclical fluctuations or to bring about an anti- 
cyclical pattern. 

The publication of this book is timely because of the new regulations of 
consumer credit of the Federal Reserve System in the present emergency and 
the possibility that such regulations may be carried over into the post-war 
period. 

James F. Cusick 

Dartmouth College 


Farm-Mortgage Credit Facilities in the United States, by Donald C. Horton, 
Harald C. Larsen and Norman J. Wall. Washington, D. C.: United States 
Government Printing Office. Miscellaneous Publications No. 478. United 
States Department of Agriculture. 1942. vii, 262 pp. 55 cents. 


A more appropriate name for this publication would have been Farm- 
Mortgage Credit Facilities of the United States, Why they should be in- 
creased and How they can be increased. Nine of the sixteen chapters deal 
exclusively with federal credit agencies and the remainder consider the more 
general problems of debts, interest rates, etc., with but little mention, let 
alone description or analysis, of other farm-mortgage credit facilities. Yet, 
the following statement expresses a point of view which permeates the entire 
discussion. 


But the public purposes to be achieved through extensive participation of 
the Government in farm-mortgage financing may be so essential as to make 
the disadvantages negligible. It is possible that the results expected of a co- 
ordinated Federal mortgage-credit service are such that the profit motive 
of private financing alone cannot be expected to accomplish them. For 
example, public policy operating through Federal mortgage-credit channels 
might be made to serve as a beneficial form of public action designed, along 
with other action programs, to assist agriculture in adapting to changing 
economic conditions. Considerations of this character may justify the 
sacrifice of certain of the advantages to farmers of private credit facilities. 
(Page 170.) 


The above can hardly be called a conclusion because it is not based on any 
evidence in the text. It is not stated as an assumption on the basis of which 
further argument is advanced. It does, however, express a philosophy which 
is evidenced in numerous parts of this work. The reviewer has no wish to 
quarrel with the authors for this point of view. He does wish to point out, 
however, that evidence is not presented to support it other than evidence 
which indicates that the Federal Government did expand its credit facilities 
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in the depths of depression. The fear expressed is that unless these agencies 
adapt themselves to present economic conditions they may not be available 
to serve should similar catastrophies arise in the future. Then, of course, the 
happy thought occurs that such a calamity may be avoided if “wise” credit 
policies are followed. 

The authors maintain that “if the Federal agencies are to be major sources 
of farm-mortgage credit in periods of prosperity as well as in depression, it 
may be necessary to modify these institutions further so they can serve not 
only as refinancing and financial-adjustment agencies but also as continuous 
sources of both purchase-money and operating credit.” 

Though the authors may seem to the reader to be advocates of “socialized 
credit” they do advocate the sound policy of modifying by improvements 
existing Federal farm-mortgage credit facilities to meet the varied needs of 
agriculture under varied conditions. As a basis on which such modifications 
may be made there is presented a wealth of information, tabulated in a very 
orderly manner and unusually well illustrated. 

Part I discusses the historical background of the farm-mortgage credit 
situation for the period 1910 to 1940. Though the general theme is much the 
same heard so many times, particularly during the last decade, its concise 
recording with suitable statistics is a useful adjunct to the text. 

Part II is particularly good in its description of Federally sponsored farm- 
mortgage loan agencies. The organization and development of each agency 
is described and its source of funds, terms and conditions of loans explained. 

The first two parts carry out very well the intentions of the authors, 
namely “to serve a principal purpose of providing a compact summary and 
analysis of factual material on farm-mortgage credit heretofore available 
only in scattered sources.” 

Part III is concerned with current problems and issues in farm-mortgage 
credit policy but as indicated in the introduction “more emphasis is placed 
upon identification of problems and issues, and upon exploration of their 
interrelationships in the light of alternative public objectives than upon 
formulation of specific recommendations.” 

The publication merits careful study by all borrowers and prospective 
borrowers and by lenders, both public and private, at this particular time 
when foundations are being built for the economic structure within which we 
must all live for the next decade or two. 





JouN B. SCHNEIDER 
University of California, Berkeley 


Business Behavior 1919-1922, by Wilson F. Payne. Chicago: The University 
of Chicago Press. 1942. xi, 215 pp. $1.50. 


This book must be regarded as timely, since many people are now in- 
terested in accounts of past experiences of the postwar development of busi- 
ness which may be used for the purpose of forecasting the future develop- 
ment of business after the present war. The approach of the author to his 
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subject is more than empirical; it may be qualified as “atheoretical.” In the 
introductory chapter, he complains of too many contradictory theories of 
business fluctuations and points to the danger of selecting some one hypoth- 
esis as a basis for analysis of data, since this selection of itself determines 
what kind of data is considered as important in such an analysis. 

The author follows an industry approach to his subject. Several industries 
were selected for survey, most of which are divided into a number of integral 
segments. He analyzes leather and shoes, textiles, iron and steel and their 
principal users, automobiles, and building and allied industries. The last 
group is not included for close examination, but simply to complete the gen- 
eral picture of the period. These surveys are preceded by a short analysis of 
the financial situation during the period under study. The author recognizes 
that this aspect of business development during 1919-22 has received keen 
examination by other observers, but he found it advisable to review it in 
order to provide necessary points of reference. 

The author thoroughly worked numerous statistical data, which are sum- 
marized graphically in eleven charts, each including some 15 to 20 curves. 
These charts represent the behavior of the various indexes characteristic of 
the industries surveyed in the book. But they are not interpreted in the text. 
This may be regarded as a shortcoming from the point of view of the general 
reader, since comparisons and interpretations of these charts are not easy 
problems. Only a specialist may do so advantageously. In the opinion of the 
reviewer, this omission, which presumably was done for economy of space, 
may be partly responsible for the fact that Payne’s book does not leave the 
reader with an impression of the behavior of business as a whole. Although 
the interactions between segments of individual industries are sufficiently 
analyzed in separate chapters, little attention is given to a systematic 
analysis of the interactions between the major branches of industry during 
the 1919-22 cycle. 

The last chapter gives a recapitulation of developments, but it occupies 
only nine pages, and its conclusions do not utilize fully the rich material 
supplied in the separate chapters. The lack of sufficient analysis of inter- 
connections may result from the atheoretical approach of the author. He 
purposely avoids using any of the analytical tools elaborated by the theory 
of the business cycle, as if he is afraid of falling in the tenets of some one 
theory. But his statement that the list of industries selected for analysis “is 
confined to a representation of those of which short industrial depressions 
are peculiarly composed” presupposes some theoretical conception of the 
business cycle, which he does not formulate explicitly. By excluding the food 
industries and agriculture, he disposes of at least one group of the theories of 
the business cycle. The exclusion of agriculture is not consistent, however, 
since in his analysis of the textile industries he was obliged to pay much 
attention to variations of cotton crops and his analysis shows how important 
the quantitative and qualitative characteristics of these crops during the 
years under study were for the explanation of the business situation in the 
cotton industry. 
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The extreme empiricism of the author, however, may be justified by the 
unsatisfactory state of the theory of the business cycle. In its present situa- 
tion, the theory may be regarded more of a barrier than a help to a theoretico- 
empirical approach of the subject, and may contribute to extreme empiricism 
of even those investigators who are conscious of the dangers of extreme em- 
piricism, as is the author. 

The book may be used advantageously in advanced courses on the busi- 
ness cycle, particularly in special seminars, though the general reader will 
hardly get a general picture of the business behavior during 1919-22. Those 
interested in the behaviors of separate industries during that period will find 
the book interesting and useful reading. 





Y. P. TImMOSHENKO 


Food Research Institute 
Stanford University 


Commodity Statistics, The 1942 Commodity Year Book. New York: Com- 
modity Research Bureau. 1942. 294 pp. $5.00, and Commodity Year Book. 
New York: Commodity Year Book. New York: Commodity Research 
Bureau, Inc. 1942. 413 pp. $7.50. 


The volume on Commodity Statistics is the fourth of an annual statistical 
series. Descriptive materials on the background, processing, and marketing 
of commodities have been omitted from the 1942 volume. Such information 
covering all the commodities listed in the statistical volume and in addition 
virtually all of the important crude and semi-finished products used in the 
national economy are presented in the Master Edition, Commodity Year Book. 
The latter, therefore, is a commodity encyclopaedia designed for permanent 
reference use. 

Commodity Statistics presents data in 61 individual or group sections for 
the 75 important crude and semi-finished materials covered in previous 
statistical editions. Statistics reported are for commodities generally classified 
as animal and animal products, beverages, forest products, fuels and electric 
power, metals, nonmetallic minerals, vegetable foodstuffs, vegetable oils and 
fibers, and such specialties as rubber, silk, and tobacco. 

Data presented in each section vary in length from 1 or 2 pages to 16 pages 
depending on available information and the importance of the commodities. 
The sections generally comprise statistics on consumption or sales, produc- 
tion, prices, exports and imports, and stocks. Frequently statistics are given 
on such subjects as yields, taxes, kinds of uses, and sizes. All the statistical 
tables are simply arranged and easy to read. Footnotes are given but in some 
tables, particularly those involving estimates and comparability of a series 
of data, technicians may desire more detailed explanations. 

The data have been compiled from a variety of recognized sources such as 
Government agencies, Boards of Trade, commodity exchange bureaus, and 
other institutions. Source references to the pertinent agencies and institu- 
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tions are printed at the foot of each table. This source information could be 
more helpful to statistical workers if it also included references to the names 
of the current publications of the particular agencies. Technicians using the 
volume may find it necessary to look up available current data. 

Information in the 1942 Year Book has been somewhat clouded by war 
conditions. Only data that have been officially released by governmental and 
quasi-official sources as not being detrimental to the war effort have been 
published. Consequently, statistics for strategic and critical commodities 
generally are not shown after the first 9 to 11 months of 1941. 

The Master Edition is composed of separate articles on over 800 different 
crude and semifinished commodities. These articles contain brief descriptions 
of available information on such subjects as physical characteristics, methods 
of production, marketing and transportation methods, comparative prices 
over recent years, principal types and grades, possible substitutes, and 
government regulations. An attempt was made to use the same general out- 
line for all commodity articles, but variations are frequent because such data 
for many commodities are lacking in part or otherwise do not warrant such 
specific treatment. Amount of material presented for each commodity varies 
from part of one page for the less used commodities up to 16 pages for a 
group of commodities such as plastics. Notable features of this volume are 
its brief historical and economic reports on rubber, plastics, and many com- 
mon and rare chemical and vegetable commodities. 

The two volumes should be valuable to statisticians, investigators, and 
research workers, because extensive statistical and informational data for so 
many important commodities are to be found in one set of reference books. 
It is hoped that the editors will find it possible and necessary to expand the 
coverage of their statistical volume to include some data for all commodities 
described in the Master Edition. 

Oscar L. ENDLER 

National Resources Planning Board 


Make or Buy, by James W. Culliton. Boston: Harvard University, Bureau 
of Business Research. 1942. iv, 130 pp. $1.50. 


Every business purchase, the author points out, involves at least a tacit 
decision that it is more desirable to buy the article in question from an out- 
side supplier than to produce it within the organization. Conversely, every 
production schedule indicates at least a tacit decision that each item in- 
volved can more satisfactorily be made by the company than purchased out- 
side. 

While there are limits to the field of such choices, the author believes that 
too often a decision to make or buy a particular item, regardless of its original 
merit, continues to stand as an established policy long after changing con- 
ditions call for further review. Obviously, few companies will find it economi- 
cal to make their own lead pencils or stationery; a specialty manufacturer 
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will not seek to put himself out of business by buying the finished articles 
which provide his primary manufacturing revenue. Between such limits, 
however, lies a wide range of optional items which could be either made or 
bought. 

Cost will be the principal factor determining such a decision, the author 
emphasizes. He advocates no one method of determining costs, pointing out 
that the facts will vary from case to case in the matter of overhead involved 
in producing a particular part, in the length of tool life, and in optional uses 
of existing facilities. Savings or increases in purchasing expense and storage 
and handling costs may or may not be capable of determination and alloca- 
tion for individual items. In the analysis of a number of case studies, the 
author points out that all costs that can be clearly allocated should be, and 
that the decisions reached in some cases might have been reversed had a dif- 
ferent analysis of cost been made. The final decision, even where cost is con- 
cerned, depends to a large extent upon the exercise of business judgment. 

Further factors which will influence “make or buy” decisions, particularly 
where the cost element is not decisive, are control over quality, control over 
scheduling of production, and risks beyond the company’s control. The 
author questions whether these factors deserve as much weight in decisions 
not to buy as they often receive, and cites cases to support his position. 

Readers in search of clear-cut generalizations as to the preferability of 
making or buying will be disappointed. The analysis and the wealth of case 
material from a variety of manufacturing operations will, however, make the 
book of interest to individuals reviewing their own company policies. With 
many companies facing postwar reconversion and a fresh start in their regular 
peacetime operations, the book is particularly timely. 

CuHarues E. Youne 

Westinghouse Electric and Manufacturing Company 


Wool and the War, by Hiram 8S. Davis. Philadelphia: University of Pennsyi- 
vania, Wharton School of Finance and Commerce, Research Report No. 
6. 1942. xi, 107 pp. 50 cents. 


This publication presents a clear picture of our war supplies problems for 
wool textiles. The author carefully weighs the wool supply prospects com- 
pared with probable needs for the war period. The report analyzes stock on 
hand and prospective production and importation of raw wool, prospective 
production of substitute materials, and the country’s capacity for producing 
wool textiles as measured by mill facilities and trained labor. Wool needs are 
appraised in terms of prospective military and civilian requirements through 
1944. 

The author concludes that, although our present and prospective supply of 
raw wool may not be enough to support civilian consumption at its peace- 
time rate without encroaching upon military reserves, there is no foreseeable 
shortage of substitute fibers ahead. Continuance of the use of raw wool for 
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civilian products at its current restricted level (the conservation order of 
the WPB) has limited civilian use to about 44 per cent of annual peace-time 
consumption, and the difference between this volume and usual peace-time 
consumption the author contends can easily be made up by using in apparel 
production more of the wastes from wool processing, more wool fiber re- 
covered from tailor’s clips and old rags, more hair fibers, and finally more 
blends of wool and rayon or cotton, or of cotton-warp wool goods. 

The author warns that this plentiful supply of substitutes should not mean 
that consumers can expect to buy wool textile products at the high level of 
1942 without sooner or later creating serious shortages. This warning note 
seems very appropriate in view of the prospective increase in purchasing 
power in 1943 and 1944, coupled with the expanded rationing program of 
non-clothing items which concentrates more consumer buying pressure on 
clothing items. 

The report concludes that the maximum military and civilian demand for 
wool textile products, which now seems in prospect, could be met from exist- 
ing mill facilities if they were operated on a three-shift rather than on a two- 
shift basis. Even with two shifts of 40 hours per week, however, the report 
concludes that the industry could handle a military business equivalent to the 
total of 1942 and a civilian output of apparel, blankets, and woven felts very 
near the level of 1939. 

According to the author both materials and productive facilities should be 
adequate for prospective military and civilian demand. It should be re- 
membered, however, that this report was written during the first half of 1942 
and events which have occurred since that time have changed significantly 
some of the prospects particularly on the demand side. For example, the 
author bases his estimates of American supply on a maximum of five million 
men in the armed forces at the end of 1942, seven million by the end of 1943, 
and nine million by the end of 1944. Most recent estimates indicate at least 
eleven and a half million will be in the armed forces by the beginning of 1944, 
or shortly thereafter. This increase in demand, coupled with continued heavy 
buying of wool textiles, makes the wool supply picture less bright. Partly 
offsetting this greater demand situation is the record 1942 wool production 
of 459,073,000 pounds, which is an increase of nearly sixty-nine million 
pounds over the 1941 output. However, the author concludes that “the sheep 
expansion program does not offer significant results either from the point of 
view of the immediate emergency or from that of nationalism.” In support 
of this conclusion, the author cites the failure of higher wool prices during 
World War I to result in any appreciable increase in sheep numbers. Despite 
the tripling of prices, the domestic clip in 1919 was only five per cent above 
that of 1914, and the wool clip would have shown an actual decline had it 
not been for expansion in Texas production. It is significant that sheep num- 
bers in the United States at the beginning of 1943 are about three per cent 
below those a year earlier. 

The report is well buttressed with ample supporting factual information 
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and a thorough knowledge of the wool textile industry. Moreover, the author 
is aware of developing events because he ends the report with a warning note 
that rationing wool textiles may be necessary not because wool textile sup- 
plies may fall short of ordinary needs, but because unspent purchasing power 
may push demand up to extraordinary levels. Also he points out appro- 
priately that neither the discouragement nor rationing of consumer purchases 
really solves the problem of existing purchasing power, and that some steps 
must be taken to remove at least a portion of existing consumer purchasing 
power from the markets. Heavier taxation, enforced savings, and related 
fiscal controls are essential if purchasing power is to be kept down to a point 
where consumer sales will not create severe shortages of most wool textile 
materials. The implications of these requirements on national fiscal policy 
and the OPA rationing program should be obvious. 





R. R. RENNE 


Montana State College and 
Office of Price Administration 


Wartime Rationing and Consumption. New York: Columbia University 
Press. League of Nations Publication. 1942. II. A. 2. 87 pp. $1.00. 


As its preface states, this short book “is concerned mainly with the impact 
of war controls and rationing on consumption and standards of living in 
countries for which information is available.” 

Its first brief chapter treats of methods and principles of consumption 
control and of direct consumer rationing. An important observation is that 
wartime governmental rationing systems ‘“... contain the elements of a 
distributive system in which consumption is guided not so much by individ- 
ual purchasing power as by human wants.” It will be interesting to see 
whether postwar developments nurse or strangle this growing plant. 

The long second chapter presents statistical evidence of food rations in 
many countries as of April 1942. For understandable reasons the sources of 
data cannot be given, and therefore the results cannot be checked by re- 
viewers. The wartime history of changes in rations is touched briefly, and 
comparisons are drawn between the 1942 rations and prewar consumption. 
The value of this compilation and comparison, which represents immense 
labor, goes almost without saying; it constitutes the most comprehensive 
survey presently available. Its value is not vitiated by the following com- 
ments: Rumanian (Bucharest) rations of sugar, meat, and fats look incredi- 
bly high. The conclusion that Italy ranks with France, Belgium, Norway, 
Finland, and Spain as having rations so abnormally low as to imply preva- 
lence of both undernourishment and malnutrition is difficult for the reviewer 
to reconcile with such advices as seep out of the country. The section on 
“Worker’s Family Consumption” carries little conviction so far as concerns 
comparisons of rations with prewar calorie consumption as shown by budget 
surveys, because the range of calorie consumption indicated by the surveys 
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was incredibly large. Acceptance of these surveys, and of various nutritional 
standards advanced by the League of Nations, seems somewhat uncritical. 

The very brief third chapter gives a worldwide survey of wartime ration- 
ing of clothing, housing and household essentials, miscellaneous necessities 
and services, and non-essentials. The final chapter seeks to measure the ef- 
fects of wartime spending restrictions upon the aggregate volume of con- 
sumption in seven countries, especially the United Kingdom, Germany, and 
the United States. 

Readers interested in finding a guide for American policy may question 
the emphasis upon the desirability of equity of food distribution in a full- 
fledged war economy. They may also feel a lack of discussion of the labor 
costs of consumer rationing, of the impact of rationing on the distribution 
system, of the circumstances of general or only of particularized food short- 
ages that justify food rationing, and of the difficulties of holding farmers 
within food-ration allotments. 

The subject matter—principles and practices alike—lends itself to con- 
tinuing inquiry and development. It is to be hoped that supplementary pub- 
lications will appear. 

M. K. BenNETT 

Food Research Institute 

Stanford University 


Statistical Methods Applied to Agricultural Economics, by Frank A. Pearson 
and Kenneth R. Bennett. New York: John Wiley & Sons, Inc. 1942. vii, 
443 pp. $4.00. 

This is a valuable book for the practical agricultural economist. It com- 
bines in a single volume nearly everything of a practical nature which the 
average student of agricultural economics needs to know about statistical 
method. The authors have illustrated the various statistical techniques with 
numerous examples taken from practical problems largely in the fields of farm 
management, marketing, and prices. Statistical theory is subordinated to 
its practical application to real problems, but the student should easily get 
the theory from study of the specific examples to which it is applied. The 
language of the text is fairly non-mathematical. The interest and attention 
of the agricultural economist reader will be held by the wide variety of well 
chosen practical problems which illustrate the statistical procedures” dis- 
cussed. 

The general arrangement of the book follows the usual order of statistics 
textbooks: measures of central tendency, variation, association, and reliabil- 
ity. It is divided into 22 chapters. Nearly every chapter has a very helpful 
section near its end entitled “Uses,” and/or “Comparison,” in which the 
authors summarize in a non-mathematical and practical manner the meat of 
the chapter. 

Two excellently prepared chapters on the tabular analysis of relationships 
cover a field which has been neglected entirely too long in agricultural statis- 
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tics books. This method has been and will continue to be the most widely 
used method of analyzing relationships. Perhaps its general usage and ap- 
parent simplicity have prompted statistical writers to dismiss it lightly. Yet 
it is well for the student of agricultural economics to know its uses and limita- 
tions. The manner in which the authors develop the treatment of tabular 
analysis provides an easily understood introduction to the theory of correla- 
tion in the following chapter. 

Regression and correlation are treated in a more understandable manner 
than is true in many textbooks. There is considerable material of a practical 
nature in these chapters that is frequently omitted from books of this kind. 

Some readers may feel that the authors have placed too much emphasis 
upon the analysis of data already collected, and too little emphasis upon the 
theory of sampling and the planning and collection of an adequate and repre- 
sentative sample. Agricultural economists too frequently follow some varia- 
tion of the “hit or miss” method in the collection of sample research data, and 
then go to great pains to be “accurate” in its summarization. It is essential 
that agricultural economics students and researchers alike be made increas- 
ingly aware of the importance of proper sampling technique. 

Other readers may notice the omission of an introductory chapter explain- 
ing the general relationship of statistics to science and research. Statistics 
are a very useful tool, and it may seem that any statistics textbook should 
attempt to orient the beginning student in an appreciation of the contribu- 
tion that statistics can and should make to the advancement of knowledge. 

E. L. Butz 





Purdue University 


Fundamental Statistics in Psychology and Education, by J. P. Guilford. New 
York: McGraw-Hill Book Company, Inc. 1942. xi, 333 pp. $3.25. 


The intention of the author in preparing this book is to present the basic 
principles and methods of statistics to the beginner in the subject. The treat- 
ment is essentially non-mathematical in character, although mathematical 
symbolism is freely used. 

No attention is given to methods of collection and arrangement of data. 
The major emphasis is upon methods of analysis and the interpretation of the 
results of analysis. Early chapters deal adequately with measures of fre- 
quency distributions, and the properties of the normal curve. The remainder 
of the book is devoted to the problems of sampling, measures of reliability 
of samples, and correlation procedures. 

Perhaps the outstanding feature of the book is the uniform clarity of pres- 
entation. There are certain points at which the reviewer feels that the fetish 
of brevity may have been pursued at the expense of clarity and completeness, 
but such lapses are negligible in comparison with the overall precision of 
expression and adequacy of treatment. 

This matter of brevity is most prominent in the last two chapters of the 
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book. The chapter on multiple and partial correlation would be more satisfy- 
ing if an explanation were included of the meaning of multiple and partial 
regression equations. Some explanation is also needed of why the successive 
steps are taken in the Doolittle Method of solution. The chapter on reliability 
and validity of tests is an excellent critical discussion of the theoretical prob- 
lems involved in evaluating test results. It seems to imply more practical 
experience with tests than the beginning statistical student would ordinarily 
be presumed to possess. 

At the end of each chapter a set of exercises is appended. These exercises 
are well designed to reinforce the student’s understanding of the principles 
set forth in the chapters, and are adequate for homework assignment or for 
a laboratory of moderate length. 

The attitude of students toward courses in statistics has in many cases 
become extremely adverse because the authors of textbooks have succeeded 
only in introducing mystery into the subject. This book takes the mystery 
out of the subject by its clear and rounded exposition, its judicious choice of 
illustrative examples, and its general adherence to the point of view of the 
beginning student. The book is indeed a valuable teaching medium. 

MartTIn A. BRUMBAUGH 
University of Buffalo 


An Outline of Probability and Its Uses, by Maurice C. Holmes. Minneapolis: 

Burgess Publishing Company. 1936. viii, 119 pp. $1.50. 

Four decades of experience in a great industrial organization justify this 
reviewer in stating that the Outline by Professor Holmes should be in the 
hands of every engineer and statistician unfamiliar with the theory of prob- 
ability. Moreover, the Outline offers an excellent résumé for one already 
acquainted with the subject 

Commendation is called for by the author’s choice of material and the 
order of its presentation. The diagrams throughout the text will appeal 
particularly to the engineer. 

In this time of stress, when our thoughts should be concentrated on win- 
ning the war, the above should suffice as a review. However, the following 
comments are added in anticipation of a second and other editions, and as 
proof that the reviewer has read the first edition from a to z. 

1. On reading page 7, where mutually exclusive events are dealt with, the 
novice may interpret “the probability that at least one will happen” as mean- 
ing that “not less than one will happen.” It might be clearer to say “some 
one of the events” or “some one or other of the events will happen.” 

2. On page 10, no doubt the author is historically correct in his statement 
that equation (9) is “called the binomial law from the fact that the term C,” 
is equal to the coefficient of the nth power of z in the binomial expansion of 
(x+y)™.” In our industry we apply the name “binomial law” to equation (9) 
because its entire right hand term is a term in the expansion of the binomial 
(p+q)™, wherein g=1 —p. 

















AMERICAN STATISTICAL ASSOCIATION: 





268 


3. On page 99, under the caption “Symbols,” the equation 
a+b =“a or b” 
might better be written 
a+b =“a or b or both.” 

One unfamiliar with symbolic logic may interpret “a or b” as meaning “a 
only or b only.” 

The above comments in no way reflect on the merits of the Outline. The 
exposition goes straight to the point without any lost motion. Moreover, it is 
entirely free from facetious phrases, a type of phrase which disfigures many 


otherwise excellent works. 
E. C. Mouina 


Seven-Place Values of Trigonometric Functions, compiled by J. Peters. New 
York: D. Van Nostrand Company, Inc. 1942. $7.50. 


Those acquainted with the well known tables Siebenstellige Werte der 
trigonometrischen Funktionen compiled by Dr. J. Peters of the Kgl. Astrono- 
mische Rechen-Institut, Berlin, and published in 1918, will welcome this 
photo-offset of those useful tables together with the English translation of the 
Introduction. The Alien Property Custodian has undoubtedly performed a 
valuable service in making these tables generally available at this time. 

The need for increasing the speed of computations and the increased use 
of electric calculating machines in astronomical, geodetic, and optical calcu- 
lations has emphasized the value of accurate natural trigonometric functions. 
These tables admirably fill that need. The use of decimal parts of degrees is 
logical in simplified calculations. Since these tables are photo-offsets they 
will not be subject to typographical errors as new type-set tables would be. 
This reviewer has used the original tables with the electric calculator for 
over a decade finding them more satisfactory than any others currently 
available. 

These tables which give the sine and tangent (cosine and cotangent) in 
thousandths of a degree and the value of the function to seven places are 
based upon those of Gellibrand’s Trigonometria Britannica, (Goudae 1633) 
which were computed for every hundredth of a degree of the quadrant. The 
functions in the present tables were interpolated for the thousandth of a 
degree, except for the tangents of angles from 89° to 90° where the values were 
calculated. The possible maximum error is given as +0.65 unit in the 
seventh decimal place of the sine function and as +0.85 unit in the seventh 
decimal of the tangent function. In general the errors are very much less 
than these and interpolations may be made for a ten-thousandth of a degree 
with considerable accuracy. Conversion tables of minutes and seconds to 
decimal parts of degrees and of degrees to time (and vice versa) are given at 
the end of the book. The tables are arranged for convenient use. 

KENNETH N. OGLE 
Dartmouth Eye Institute 
Hanover, New Hampshire 
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